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This application claims the benefit of United States provisional patent 
application serial number 60/157,923, filed October 6, 1999. The entire content 
of this provisional patent application is incorporated herein by reference. 

The invention disclosed herein was made in part with Government 
5 support under Grant DAMD 17-94-J-4234 awarded by the Department of 
Defense and POl CA32737 awarded by the National Institutes of Health. The 
Government may have certain rights to the invention. 

FIELD OF THE INVENTION 

10 This invention relates to nucleic acid and amino acid sequences of new 

genes that are differentially expressed in cells that overexpress the Her-2/neu 
oncogene. 

BACKGROUND OF THE INVENTION 

15 Cancers of the breast and ovary are among the leading causes of death 

among women. The cumulative lifetime risk of a woman developing breast 
cancer estimated to be 1 in 9. Consequendy, understanding the origins of these 
malignancies as well as models for the identification of new diagnostic and 
therapeutic modalities is of significant interest to health care professionals. In 

20 this context, cancer cells have been shown to exhibit unique gene expression, and 
dozens of cancer-specific genetic markers, tumor antigens, have been identified. 

The human HER-2/w// (c-erbB-2) proto-oncogene encodes a 
transmembrane receptor tyrosine kinase with extensive sequence homology to 
the epidermal growth factor receptor (EGFR) (Bargmann, C. L, Hung, M. C and 

25 Weinberg, R. A. (1986) Cell, 45(5), 649-57). Amplification and/or overexpression 
of HER-2///«/ has been found in one-third of human breast and one-fifth of 
ovarian cancers ( Slamon, D. J., Clark, G. M., Wong, S. G., Levin, W. J., Ullrich, 
A. and McGuire, W. L. (1987) Science, 235(4785), 177-82Slamon, D. J., 
Godolphin, W.Jones, L. A., Holt, J. A., Wong, S. G., Keith, D. E., Levin, W. J., 

30 Stuart, S. G., Udovc, J., Ullrich, A. and et al. (1989) Science, 244(4905), 707-12). 
In addition, the HER-2/ neu alteration is associated with a poor clinical outcome 
in that women whose tumors contain it experience earlier disease relapse and 
shorter overall survival (Slamon, D. J., Godolphin, W., Jones, L. A., Holt, J. A., 
Wong, S. G„ Keith, D. E, Levin, W. J., Stuart, S. G., Udove, J., Ullrich, A. and et 
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al. (1989) Science, 244(4905), 707-12; O'Reilly, S. M, Barnes, D. M., Camplejohn, 
R. S., Bartkova, J., Gregory, W. M. and Richards, M. A. (1991) Br / Cancer, 63(3), 
444-6; Press, M. R, Pike, M. C, Chazin, V. R., Hung, G., Udove, J. A., 
Markowicz, M., Danyluk, J., Godolphin, W., Sliwkowski, M, Akita, R. and et al. 
5 (1993) Cancer Res, 53(20), 4960-70; Seshadri, R., Horsfali, D. J, Firgaira, F, 
McCaul, K., Sedur, V., Chalmers, A. H., Yeo, R., Ingram, D., Dawkins, H. and 
Hahnel, R. (1994) Int J Cancer, 56(1), 61-5). Two hypotheses potentially account 
for this prognostic association. First, HER-2/neu overexpression may be an 
epiphenomenon serving merely as a marker of aggressive breast cancers. 
10 Conversely, the alteration may be causal for the aggressive phenotype. 
Considerable circumstantial evidence now supports the latter possibility, with 
data suggesting that overexpression plays a direct causal role in the pathogenesis 
of the malignancies in which it occurs (Shih, C, Padhy, L. C, Murray, M. and 
Weinberg, R. A. (1981) Nature, 290(5803), 261-4; Di Fiore, P. P., Pierce, J. H., 
15 Kraus, M. H., Segatto, O., King, C. R. and Aaronson, S. A. (1987) Science, 
237(4811), 178-82; Hudziak, R. M., Schlessinger, J. and Ullrich, A. (1987) Prvc 
Natl Acad Sci US A, 84(20), 7159-63). 

The subtraction cloning technique termed differential hybridization, also 
known as plus/minus screening (St John, T. P. and Davis, R. W. (1979) Cell, 
20 16(2), 443-52), can be used to isolate genes which are differentially expressed in 
cells which overexpress HER-2 as compared to control cells. This approach has 
the advantage of comparing two human breast cancer cell lines which are 
identical except for HER-2/ neu overexpression allowing for a direct comparison 
of cDNAs derived from the two cell populations. As disclosed herein, using this 
25 approach, we identified a series of genes, either previously characterized or 
entirely novel, whose expression levels are altered in association with HER-2/ neu 
overexpression. The evidence suggests that these genes might be mediators of 
the HER-2 overexpressing phenotype since we have confirmed their differential 
expression not only in human ovarian cancer cell lines which overexpress HER-2 
30 but also primary breast cancer specimens containing this alteration. 

The discovery of Her-2/neu overexpression modulated proteins, and the 
polynucleotides which encode them satisfies a need in the art by providing new 
compositions which have potential in understanding and modulating disorders 
associated with cell proliferation. 
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SUMMARY OF THE INVENTION 

The present invention provides new Her-2/neu overexpression 
modulated proteins (including proteins having both new and known amino acid 
5 sequences such as novel splice variants of known proteins) hereinafter designated 
HOMPS. A first HOMPS protein is designated H17. The expression of H17 
increases in cells which overexpress Her-2/neu. A second HOMPS protein is 
designated C40. The expression of C40 decreases in cells which overexpress 
Her-2/neu. A third HOMPS protein is designated H41. The expression of H41 

10 increases in cells which overexpress Her-2/neu. A fourth HOMPS protein is 
designated HI 3. The expression of HI 3 increases in cells which overexpress 
Her-2/neu. A fifth HOMPS protein is designated H14. The expression of H14 
increases in cells which overexpress Her-2/neu. In addition, the present 
invention discloses a HOMPS related polynucleotide sequence designated H63. 

15 The expression of H63 increases in cells which overexpress Her-2/neu. 

The invention provides polynucleotides corresponding or complementary 
to all or part of the HOMPS genes, mRNAs, and/or coding sequences, preferably 
in isolated form, including polynucleotides encoding HOMPS proteins and 
fragments thereof, DNA, RNA, DNA/RNA hybrid, and related molecules, 

20 polynucleotides or oligonucleotides complementary to the HOMPS genes or 
mRNA sequences or parts thereof, and polynucleotides or oligonucleotides that 
hybridize to the HOMPS genes, mRNAs, or to HOMPS-encoding 
polynucleotides. Also provided are means for . isolating cDNAs and the genes 
encoding HOMPS. Recombinant DNA molecules containing HOMPS 

25 polynucleotides, cells transformed or transduced with such molecules, and host- 
vector systems for the expression of HOMPS gene products are also provided 
The invention further provides HOMPS proteins and polypeptide fragments 
thereof. The invention further provides antibodies that bind to HOMPS proteins 
and polypeptide fragments thereof, including polyclonal and monoclonal 

30 antibodies, murine and other mammalian antibodies, chimeric antibodies, 
humanized and fully human antibodies, and antibodies labeled with a detectable 
marker. 

Accordingly, the invention provides a substantially purified HOMPS 
having the amino acid sequence shown in Figure 2, Figure 4, Figure 6, Figure 9 
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or Figure 11. A typical embodiment of the invention provides an isolated and 
substantially purified polynucleotide that encodes HOMPS. In a particular aspect, 
the polynucleotide is the nucleotide sequence shown in Figure 1, Figure 3, Figure 
5, Figure 8 or Figure 10. The invention also provides a polynucleotide sequence 
5 comprising the complement of the nucleotide sequences shown in Figure 1, 
Figure 3, Figure 5, Figure 8 or Figure 10 or variants thereof. In addition, the 
invention provides polynucleotide sequences which hybridize under stringent 
conditions to the nucleotide sequences shown in Figure 1, Figure 3, Figure 5, 
Figure 8 or Figure 10. The invention further provides nucleic acid sequences 
10 encoding fragments or the complement of the polynucleotide sequences, as well 
as expression vectors and host cells comprising polynucleotides that encode 
HOMPS. 

The invention further provides methods for detecting the presence and 
status of HOMPS polynucleotides and proteins in various biological samples, as 
15 well as methods for identifying cells that express HOMPS. A typical embodiment 
of this invention provides methods for monitoring HOMPS gene products in a 
tissue sample having or suspected of having some form of growth disregulation 
such as cancer. 

BRIEF DESCRIPTION OF THE FIGURES 
20 Figure 1 shows a nucleic acid sequence of H17 (SEQ ID NO: 1). 

Figure 2 shows an amino acid sequence of HI 7 (SEQ ID NO: 2). 
Figure 3 shows a nucleic acid sequence of C40 (SEQ ID NO: 3). 
; Figure 4 shows an amino acid sequence of C40 (SEQ ID NO: 4). 
Figure 5 shows a nucleic acid sequence of H41 (SEQ ID NO: 5). 
25 Figure 6 shows an amino acid sequence of H41 (SEQ ID NO: 6). 

Figure 7 shows a nucleic acid sequence of H63 (SEQ ID NO: 7). 
Figure 8 shows an nucleic acid sequence of HI 3 (SEQ ID NO: 8). 
Figure 9 shows an arnino acid sequence of HI 3 (SEQ ID NO: 9). 
Figure 10 shows a nucleic acid sequence of H14 (SEQ ID NO: 10). 
30 Figure 11 shows an amino acid sequence of H14 (SEQ ID NO: 1 1). 

Figure 12 shows a Northern blot analysis of candidate gene expression. 
Differential expression patterns were confirmed by Northern blot analyses for 5 
C clones and 11 H clones. For clones H35 and H45, 20 ug of total RNA was 
loaded in each lane. For the remaining clones, 2 ug of poly (A) RNA was 
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loaded ( C = MCF-7/control mRNA; H = MCF-7/HER-2 mRNA. ) Ethidium 
bromide staining of RNA gel is shown below autoradiograms to illustrate equal 
loading and quality of RNA. The size of the differentially expressed transcript is 
indicated on the left. 

5 Figure 13 shows an in vitro transcription / translation of the proteins 

from the identified differentially expressed transcripts. The transcription-coupled 
translation reaction was performed using T3 RNA polymerase, rabbit reticulocyte 
lysate and [ 35 S]methionine labeling. The first lane represents the 61 kDa 
luciferase protein product which was used as a positive control The C40, HI 3, 
10 H17, and H37 protein products are seen as distinct bands at 55, 30, 50, and 90 
kDa, respectively, whereas the H41 cDNA produced two faint bands at 30 kDa 
and a lower molecular weight. The protein molecular weight marker is shown on 
the left 

Figure 14 shows a schematic representation of the three differentially 

15 expressed novel genes. The thin line indicates a stretch of nucleotide sequences 
ending at poly A tail, denoted by (A) N , at the base pair number written next. The 
filled box illustrates location of the most probable open reading frame, with the 
numbers below indicating the base pair positions of start and stop codons 
respectively. A, Map of the C40 cDNA, the leucine zipper motif is denoted by 

20 LZ and shown above is the corresponding amino acid positions and sequences.- 
B, Map of the HI 7 cDNA, the asterisks above poly A tail indicate the presence 
of polyadenylation signals. The hatched box illustrates the amino acid region of 
shared homology and the sequence alignments are shown below the gene. 
Identical residues are indicated by shading (Z77667 (SEQ ID NO: 12) = a C 

25 elegans cDNA of unknown function, AE001086 (SEQ ID NO: 13) = sarcosine 
oxidase). Numbers at the right represent corresponding amino acid positions. 
Gaps introduced for maximal alignment are marked with dashes. C, Map of the 
H41 cDNA, NLS = nuclear localization signal, AF005858 (SEQ ID NO: 14) = 
one of the "fast evolving" drvsophila genes of unknown function. 

30 Figure 15 shows the confirmation of differential expression in CaOv-3 

ovarian cancer cells overexpressing HER-2. The differential expression patterns 
of three C clones and nine H clones identified in MCF-7 breast cancer cells were 
reproduced in CaOv-3 ovarian cancer cell counterparts on Northern blot (C = 
Control, H = HER-2 transfectant) 20 jig of total RNA was loaded in each lane. 
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Ethidium bromide staining of 18 S ribosomal RNA is shown as a loading control 
below autoradiograms. The transcript sizes are as shown in Figure 12. 

Figure 16 shows that the upregulation of the H37 and H41 transcripts 
correlates with HER-2/ neu overexpression in human breast tumors (p<0.005 and 
5 p<0.075 respectively). A, Northern blot analysis was performed to compare 
expression levels of the HER-2 vs. H37 cDNAs in 15 individual breast tumor 
samples. Ten Ug of total RNA was loaded in each lane, and the same blot was 
stripped for rehybridization with the second probe. B, The expression levels of 
the HER-2 vs. H41 cDNAs were analyzed in a separate Northern blot 

10 experiment The same set of breast tumor samples were used as in panel A 
except that the #16 tumor was substituted for the #14 due to depletion of the 
sample. Fifteen \lg of total RNA was loaded in each lane except for tumor #15 
for which only 5 were used because of lack of material. The blot was stripped 
as in A. For both A and B, ethidium bromide staining of 28 S ribosomal RNA is 

15 shown below the autoradiograms for RNA loading control. 

DESCRIPTION OF THE INVENTION 

Unless otherwise defined, all terms of art, notations and other scientific 
terminology used herein are intended to have the meanings commonly 
understood by those of skill in the art to which this invention pertains. In some 
cases, terms with commonly understood meanings are defined herein for clarity 
and/or for ready reference, and the inclusion of such definitions herein should 
not necessarily be construed to represent a substantial difference over what is 
generally understood in the art The techniques and procedures described or 
referenced herein are generally well understood and commonly employed using 
conventional methodology by those skilled in the art, such as, for example, the 
widely utilized molecular cloning methodologies described in Sambrook et al., 
1989, Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. As appropriate, procedures 
involving the use of commercially available kits and reagents are generally carried 
out in accordance with manufacturer defined protocols and /or parameters unless 
otherwise noted. 

In addition, a variety of art accepted definitions and methods for 
manipulating, evaluating and utilizing polypeptide and polynucleotide sequences 
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are well known in the art and are widely used as a standard practice in the field of 
biotechnology. Such common terms and practices are provided, for example in 
U.S. Patent No. 5,922,566, which is incorporated herein by reference and which 
recites a variety of the common terms and methodologies illustrated below. 
5 Before the present proteins, nucleotide sequences, and methods are 

described, it is understood that this invention is not limited to the particular 
methodology, protocols, cell lines, vectors, and reagents described as these may 
vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only and is not intended to limit 
10 the scope of the present invention which will be limited only by the appended 
claims. 

It must be noted that as used herein and in the appended claims, the 
singular forms "a", "an", and "the" include plural reference unless the context 
clearly dictates otherwise. Thus, for example, reference to "a host cell" includes a 
15 plurality of such host cells, reference to the "antibody" is a reference to one or 
more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meanings as commonly understood by one of ordinary skill in the 

20 art to which this invention belongs. Although any methods and materials similar 
or equivalent to those described herein can be used in the practice or testing of 
the present invention, the preferred methods, devices, and materials are now 
described. All publications mentioned herein are incorporated herein by reference 
for the purpose of describing and disclosing the cell lines, vectors, and 

25 methodologies which are reported in the publications which might be used in 
connection with the invention. Nothing herein is to be construed as an admission 
that the invention is not entitled to antedate such disclosure by virtue of prior 
invention. 

30 DEFINITIONS 

As used herein, the term "polynucleotide" means a polymeric form of 
nucleotides of at least about 10 bases or base pairs in length, either 
ribonucleotides or deoxynucleotides or a modified form of either type of 
nucleotide, and is meant to include single and double stranded forms of DNA. 
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As used herein, the term "polypeptide" means a polymer of at least about 6 
amino acids. Throughout the specification, standard three letter or single letter 
designations for amino acids are used. 

"Nucleic acid sequence", as used herein, refers to an oligonucleotide, 
5 nucleotide, or polynucleotide, and fragments or portions thereof, and to DNA or 
RNA of genomic or synthetic origin which may be single- or double-stranded, 
and represent the sense or antisense strand. Similarly, "amino acid sequence", as 
used, herein refers to an oligopeptide, peptide, polypeptide, or protein sequence, 
and fragments or portions thereof, and to naturally occurring or synthetic 
10 molecules. 

Where "amino acid sequence" is recited herein to refer to an amino acid 
sequence of a naturally occurring protein molecule, "amino acid sequence" and 
like terms, Such as "polypeptide" or "protein" are not meant to limit the amino 
acid sequence to the complete, native amino acid sequence associated with the 

15 recited protein molecule. 

"Peptide nucleic acid", as used herein, refers to a molecule which 
comprises an oligomer to which an amino acid residue, such as lysine, and an 
amino group have been added. These small molecules, also designated anti-gene 
agents, stop transcript elongation by binding to their complementary strand of 

20 nucleic acid (Nielsen, P. E. et al. (1993) Anticancer Drug Des. 8:53-63). 

HOMPS, as used herein, refers to the amino acid sequences of substantially 
purified HOMPS obtained from any species, particularly mammalian, including 
bovine, ovine, porcine, murine, equine, and preferably human, from any source 
whether natural, synthetic, semi-synthetic, or recombinant 

25 "Consensus", as used herein, refers to a nucleic acid sequence which has been 
resequenced to resolve uncalled bases, or which has been extended using the XL- 
PCR kit (Perkin Elmer, Norwalk, Conn. ) in the 5' and/or the 3' direction and 
resequenced, or which has been assembled from the overlapping sequences of 
more than one clone using the GELVIEW Fragment Assembly system (GCG, 

30 Madison, Wis. ), or which has been both extended and assembled. 

A "variant" of HOMPS, as used herein, refers to an amino acid sequence 
that is altered by one or more amino acids. The variant may have "conservative" 
changes, wherein a substituted amino acid has similar structural or chemical 
properties, e. g. , replacement of leucine with isoleucine. More rarely, a variant 
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may have "nonconservative" changes, e. g. , replacement of a glycine with a 
tryptophan. Similar minor variations may also include amino acid deletions or 
insertions, or both. Guidance in deterrriining which amino acid residues may be 
substituted, inserted, or deleted without abolishing biological or immunological 
5 activity may be found using computer programs well known in the art, for 
example, LASERGENE software. 

A "deletion", as used herein, refers to a change in either amino acid or 
nucleotide sequence in which one or more amino acid or nucleotide residues, 
respectively, are absent. 
10 An "insertion" or "addition", as used herein, refers to a change in an amino acid 
or nucleotide sequence resulting in the addition of one or more amino acid or 
nucleotide residues, respectively, as compared to the naturally occurring 
molecule. 

A "substitution", as used herein, refers to the replacement of one or more 
15 amino acids or nucleotides by different amino. acids or nucleotides, respectively. 

The term "biologically active", as used herein, refers to a protein having 
structural, regulatory, or biochemical functions of a naturally occurring molecule. 
Likewise, "immunologically active" refers to the capability of the natural, 
recombinant, or synthetic HOMPS, or any oligopeptide thereof, to induce a 
20 specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "agonist", as used herein, refers to a molecule which, when 
bound to HOMPS, causes a change in HOMPS which modulates the activity of 
HOMPS. Agonists may include proteins, nucleic acids, carbohydrates, or any 
25 other molecules which bind to HOMPS. 

The terms "antagonist" or "inhibitor", as used herein, refer to a molecule 
which, when bound to HOMPS, blocks or modulates the biological or 
immunological activity of HOMPS. Antagonists and inhibitors may include 
proteins, nucleic acids, carbohydrates, or any other molecules which bind to 
30 HOMPS. 

The term "modulate", as used herein, refers to a change or an alteration 
in the biological activity of HOMPS. Modulation may be an increase or a 
decrease in protein activity, a change in binding characteristics, or any other 
change in the biological, functional or immunological properties of HOMPS. 
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The term "mimetic", as used herein, refers to a molecule, the structure of 
which is developed from knowledge of the structure of HOMPS or portions 
thereof and, as such, is able to effect some or all of the actions related to the 
human Her-2/neu ovcrexpression modulated proteins. 
5 The term "derivative", as used herein, refers to the chemical modification 

of a nucleic acid encoding HOMPS or the encoded HOMPS. Illustrative of such 
modifications would be replacement of hydrogen by an alkyl, acyl, or amino 
group. A nucleic acid derivative would encode a polypeptide which retains 
essential biological characteristics of the natural molecule. 

10 The term "substantially purified", as used herein, refers to nucleic or 

amino acid sequences that are removed from their natural environment, isolated 
or separated, and are at least 60% free, preferably 75% free, and most preferably 
90% free from other components with which they are naturally associated. 

"Amplification", as used herein, refers to the production of additional 

15 copies of a nucleic acid sequence and is generally carried out using polymerase 
chain reaction (PCR) technologies well known in the art (Dieffenbach, C. W. and 
G. S. Dveksler (1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor 
Press, Plainview, N. Y. ). 

The term "hybridization", as used herein, refers to any process by which a 

20 strand of nucleic acid binds with a complementary strand through base pairing 
The term "hybridization complex", as used herein, refers to a complex formed 
between two nucleic acid sequences by virtue of the formation of hydrogen 
bonds between complementary G and C bases and between complementary A 
and T bases; these hydrogen bonds may be further stabilized by base stacking 

25 interactions. 

The two complementary nucleic acid sequences hydrogen bond in an antiparallel 
configuration. A hybridization complex may be formed in solution (e. g. C^ or 
analysis) or between one nucleic acid sequence present in solution and another 
nucleic acid sequence immobilized on a solid support (e. g. , membranes filters, 
30 chips, pins or glass slides to which cells have been fixed for in situ hybridization). 

The terms "complementary" or "complementarity", as used herein, refer 
to the natural binding of polynucleotides under permissive salt and temperature 
conditions by base-pairing. For example, the sequence "A-G-T" binds to the 
complementary sequence "T-C-A". Complementarity between two single- 
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stranded molecules may be "partial", in which only some of the nucleic acids 
bind, or it may be complete when total complementarity exists between the single 
stranded molecules. The degree of complementarity between nucleic acid strands 
has significant effects on the efficiency and. strength of hybridization between 
5 nucleic acid strands. This is of particular importance in amplification reactions, 
which depend upon binding between nucleic acids strands. 

The term "homology", as used herein, refers to a degree of 
complementarity. There may be partial homology or complete homology (i. e. , 
identity). A partially complementary sequence is one that at least partially inhibits 

10 an identical sequence from hybridizing to a target nucleic acid; it is referred to 
using the functional term "substantially homologous. " The inhibition of 
hybridization of the completely complementary sequence to the target sequence 
may be examined using a hybridization assay (Southern or northern blot, solution 
hybridization and the like) under conditions of low stringency. A substantially 

15 homologous sequence or probe will compete for and inhibit the binding (i. e. , 
the hybridization) of a completely homologous sequence or probe to the target 
sequence under conditions of low stringency. This is not to say that conditions of 
low stringency are such that non-specific binding is permitted; low stringency 
conditions require that the binding of two sequences to one another be a specific 

20 (i. e. , selective) interaction. The absence of non-specific binding may be tested by 
the use of a second target sequence which lacks even a partial degree of 
complementarity . (e. g. , less than about 30% identity); in the absence of non- 
specific binding, the probe will not hybridize to the second non-complementary 
target sequence. 

25 As used herein, the terms "hybridize", "hybridizing", "hybridizes" and 

the like, used in the context of polynucleotides, are meant to refer to 
conventional hybridization conditions, preferably such as hybridization in 50% 
formamide/6XSSC/0.1% SDS/100 u:g/ml ssDNA, in which temperatures for 
hybridization are above 37 degrees C and temperatures for washing in 0.1X 

30 SSC/0.1% SDS are above 55 degrees C, and most preferably to stringent 
hybridization conditions. 

"Stringency" of hybridization reactions is readily determinable by one of 
ordinary skill in the art, and generally is an empirical calculation dependent upon 
probe length, washing temperature, and salt concentration. In general, longer 



12 



WO 01/25250 PCT/US00/27649 

probes require higher temperatures for proper annealing, while shorter probes 
need lower temperatures. Hybridization generally depends on the ability of 
denatured DNA to reanneal when complementary strands are present in an 
environment below their melting temperature. The higher the degree of desired 
5 homology between the probe and hybridizable sequence, the higher the relative 
temperature that can be used. As a result, it follows that higher relative 
temperatures would tend to make the reaction conditions more stringent, while 
lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et al, Current Protocols in Molecular 

10 Biology, Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, 
may be identified by those that (1) employ low ionic strength and high 
temperature for washing, for example 0.015 M sodium chloride/0.0015 M 
sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during 

15 hybridization a denaturing agent, such as formamide, for example, 50% (v/v) 
formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% 
polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 
x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 

20 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon 
sperm DNA (50 ug/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with 
wishes at 42°C in 0.2 x SSC (sodium chloride/sodium, citrate) and 50% 
formamide at 55°C, followed by a high-stringency wash consisting of 0.1 x SSC 
containing EDTA at 55°C. 

25 "Moderately stringent conditions" may be identified as described by 

Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, New York: 
Cold Spring Harbor Press, and include the use of washing solution and 
hybridization conditions (e.g., temperature, ionic strength and %SDS) less 
stringent than those described above. An example of moderately stringent 

30 conditions is overnight incubation at 37°C in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 
mg/ mL denatured sheared salmon sperm DNA, followed by washing the filters 
in 1 x SSC at about 37-50°C The skilled artisan will recognize how to adjust the 
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temperature, ionic strength, etc, as necessary to accommodate factors such as 
probe length and the like. 

In the context of amino acid sequence comparisons, the term "identity" is 
used to express the percentage of amino acid residues at the same relative 
5 positions that are the same. Also in this context, the term "homology" is used to 
express the percentage of amino acid residues at the same relative positions that 
are either identical or are similar, using the conserved amino acid criteria of 
BLAST analysis, as is generally understood in the art. For example, % identity 
values may be generated by WU-BLAST-2 (Altschul et aL, 1996, Methods in 

10 Enzymology 266:460-480; http://blast.wusd/edu/blast/ README, html). 
Further details regarding amino acid substitutions, which are considered 
conservative under such criteria, are provided below. 

As will be understood by those of skill in the art, the stringency of 
hybridization may be altered in order to identify or detect identical or related 

15 polynucleotide sequences. 

The term "antisense", as used herein, refers to nucleotide sequences 
which are complementary to a specific DNA or RNA sequence. The term 
"antisense strand" is used in reference to a nucleic acid strand that is 
•complementary to the "sense" strand. Antisense molecules may be produced by 

20 any method, including synthesis by ligating the gene(s) of interest in a reverse 
orientation to a viral promoter which permits the synthesis of a complementary 
strand. Once introduced into a cell, this transcribed strand combines with natural 
sequences produced by the cell to form duplexes. These duplexes then block 
either the further transcription or translation. In this manner, mutant phenotypes 

25 may be generated. The designation "negative" is sometimes used in reference to 
the antisense strand, and "positive" is sometimes used in reference to the sense 
strand. 

The term "portion", as used herein, with regard to a protein (as in "a 
portion of a given protein") refers to fragments of that protein. The fragments 
30 may range in size from four amino acid residues to the entire amino acid 
sequence minus one amino acid. Thus, a protein "comprising at least a portion of 
the amino acid sequence of Figure 2, Figure 4, Figure 6, Figure 9 or Figure 11" 
encompasses the full-length human HOMPS and fragments thereof. 
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"Transformation", as defined herein, describes a process by which 
exogenous DNA enters and changes a recipient cell. It may occur under natural 
or artificial conditions using various methods well known in the art 
Transformation may rely on any known method for the insertion of foreign 
5 nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method is 
selected based on the host cell being transformed and may include, but is not 
limited to, viral infection, electroporation, lipofection, and particle bombardment. 
Such "transformed" cells include stably transformed cells in which the inserted 
DNA is capable of replication either as an autonomously replicating plasmid or 

10 as part of the host chromosome. They also include cells which transiendy express 
the inserted DNA or RNA for limited periods of time. 

The term "antigenic determinant", as used herein, refers to that portion 
of a molecule that makes contact with a particular antibody (i. e. an epitope). 
When a protein or fragment of a protein is used to immunize a host animal, 

15 numerous regions of the protein may induce the production of antibodies which 
bind specifically to a given region or three-dimensional structure on the protein; 
these regions ot structures are referred to as antigenic determinants. An antigenic 
determinant may compete with the intact antigen (i. e. , the immunogen used to 
elicit the immune response) for binding to an antibody. 

20 The terms "specific binding" or "specifically binding", as used herein, in 

reference to the interaction of an antibody and a protein or peptide, mean that 
the interaction is dependent upon the presence of a particular structure (i. e., the 
antigenic determinant or epitope) on the protein; in other words, the antibody is 
recognizing and binding to a specific protein structure rather than to proteins in 

25 general. For example, if an antibody is specific for epitope "A", the presence of a 
protein containing epitope A (or free, unlabeled A) in a reaction containing 
labeled "A" and the antibody will reduce the amount of labeled A bound to the 
antibody. 

The term "sample", as used herein, is used in its broadest sense. A 
30 biological sample suspected of containing nucleic acid encoding HOMPS or 
fragments thereof may comprise a cell, chromosomes isolated from a cell (e. g. , a 
spread of metaphase chromosomes), genomic DNA (in solution or bound to a 
solid support such as for Southern analysis), RNA (in solution or bound to a 
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solid support such as for northern analysis), cDNA (in solution or bound to a 
solid support), an extract from cells or a tissue, and the like. 

The term "correlates with expression of a polynucleotide", as used herein, 
indicates that the detection of the presence of ribonucleic acid that is similar to 
5 Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10 by northern analysis is 
indicative of the presence of mRNA encoding HOMPS in a sample and thereby 
correlates with expression of the transcript from the polynucleotide encoding the 
protein. 

"Alterations" in the polynucleotide of Figure 1, Figure 3, Figure 5, Figure 8 or 
10 Figure 10, as used herein, comprise any alteration in the sequence of 
polynucleotides encoding HOMPS including deletions, insertions, and point 
mutations that may be detected using hybridization assays. Included within this 
definition is the detection of alterations to the genomic DNA sequence which 
encodes HOMPS (e. g. , by alterations in the pattern of restriction fragment 
15 length polymorphisms capable of hybridizing to Figure 1, Figure 3, Figure 5, 
Figure 8 or Figure 10), the inability of a selected fragment of Figure 1, Figure 3, 
Figure 5, Figure 8 or Figure 10 to hybridize to a sample of genomic DNA (e. g. , 
using allele-specific oligonucleotide probes), and improper or unexpected 
hybridization, such as hybridization to a locus other than the normal 
20 chromosomal locus for the polynucleotide sequence encoding HOMPS (e. g. , 
using fluorescent in situ hybridization [FISH] to metaphase chromosomes 
spreads). 

As used herein, the term "antibody" refers to intact molecules as well as 
fragments thereof, such as Fab, F(ab , ) 2 and Fv, which are capable of binding the 

25 epitopic determinant. Antibodies that bind HOMPS polypeptides can be 
prepared using intact polypep tides or fragments containing small peptides of 
interest as the immunizing antigen. The polypeptide or peptide used to immunize 
an animal can be derived from the transition of RNA or synthesized chemically, 
and can be conjugated to a carrier protein, if desired. Commonly used carriers 

30 that are chemically coupled to peptides include bovine serum albumin and 
thyroglobulin. The coupled peptide is then used to immunize the animal (e. g. , a 
mouse, a rat, or a rabbit). 

The term "humanized antibody", as used herein, refers to antibody 
molecules in which amino acids have been replaced in the non-antigen binding 
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regions in order to more closely resemble a human antibody, while still retaining 
the original binding ability. 

Additional definitions are provided throughout the subsections that 

follow. 

5 

THE INVENTION 

The invention is based on the discovery of new human Her-2/neu 
overexpression modulated proteins (HOMPS), the polynucleotides encoding 
HOMPS, and the use of these compositions for the evaluation and 

10 characterization of disorders associated with abnormal cellular proliferation. 

Amplification and resulting- overexpression of the HER-2/?teu proto- 
oncogene is found in approximately 30% of human breast and 20% of human 
ovarian cancers. To better understand the molecular events associated with 
overexpression of this gene in human breast cancer cells, differential 

15 hybridization was used to identify genes whose expression levels are altered in 
cells overexpressing this receptor. As illustrated below, of 16,000 clones screened 
from an overexpression cell cDNA library, a total of 19 non-redundant clones, 
were isolated including 7 whose, expression decreases (C clones) and 12 which 
increase (H clones) in association with HER-2/rjeM overexpression. Of these, 5 C 

20 clones and 11 H clones have been confirmed to be differentially expressed by 
Northern blot analysis. This group includes nine genes of known function, three 
previously sequenced genes of relatively uncharacterized function and four novel 
genes without match in GenBank. Examination of the previously characterized 
genes indicates that they represent sequences known to be frequently associated 

25 with the malignant phenotype, suggesting that the subtraction cloning strategy 
used identified appropriate target genes. In addition, differential expression of 12 
of 16 (75%) cDNAs identified in the breast cancer cell lines are also seen in 
HER-2/neu overexpressing ovarian cancer cells, indicating that they represent 
generic associations with HER-2//?«/ overexpression. Finally, upregulation of 

30 two of the identified cDNAs, one novel and one identified but as yet 
uncharacterized gene, was confirmed in human breast cancer specimens in 
association with HER-2/ncu overexpression. Further characterization of these 
genes may yield insight into the fundamental biology and pathogenetic effects of 
HER-2/ neu overexpression in human breast and ovarian cancer cells. 
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To more directly evaluate the potential biologic role of HER-2/ 
overexpression in the pathogenesis of breast cancer, disclosed herein is an 
experimental model in which the effects of overexpression in human breast 
cancer cells can be studied (Pietras, R. J., Fendly, B. M., Chazin, V. R., Pegram, 
5 M D., Howell, S. B. and Slamon, D. J. (1994) Oncogene, 9(7), 1829-38; Pietras, R. 
J., Arboleda, J., Reese, D. M., Wongvipat, N., Pegram, M. D., Ramos, L., 
Gorman, C. M, Parker, M. G., Sliwkowski, M. X. and Slamon, D. J. (1995) 
Oncogene, 10(12), 2435-46; Pegram, M. D., Finn, R. S., Arzoo, K., Beryt, M, 
Pietras, R. J. and Slamon, D. J. (1997) Oncogene, 15(5), 537-47; Chazin, V.R. 

10 (1991). The biologic effects of HER-2/;/*// proto-oncogene overexpression, 
Chapter 2. Department of Microbiology and Immunology, University of 
California, Los Angeles). In these experiments, human MCF-7 breast cancer cells 
which express normal levels of the receptor were trans fected with a retroviral 
expression vector containing a full length cDNA encoding the human HER-2 

15 gene (Chazin, V. R., Kaleko, M., Miller, A. D. and Slamon, D. J. (1992) Oncogene, 
7(9), 1859-66). Multiple rounds of infection and sorting of the top 5% of HER-2 
overexpressing, pooled transfectants generated a stably transfected cell line, 
MCF-7/HER-2, in which HER-2 expression levels were comparable to those 
observed in human HER-2 overexpressing breast cancer specimens (Pegram, M. 

20 D., Finn, R. S., Arzoo, K., Beryt, M., Pietras, R. J. and Slamon, D. J. (1997) 
Oncogene, 15(5), 537-47). MCF-7 cells were similarly transfected using an empty 
neomycin resistance vector including multiple infections, producing the MCF- 
7/contxol cell line. The amount of HER-2/ neu protein expressed, as determined 
by quantitative Western blot analysis, was approximately 1 .62 pg/cell for MCF- 

25 7/HER-2 cells as compared to 0.36 pg/cell for MCF-7/control cells (Press, M. 
F., Pike, M. C, Chazin, V. R., Hung, G., Udove, J. A., Markowicz, M., Danyluk, 
J., Godolphin, W., Sliwkowski, M., Akita, R. and et al. (1993) Cancer Res, 53(20), 
4960-70). In vitro and in vivo studies of these engineered cells demonstrated that 
the growth characteristics of the MCF-7 /HER-2 human breast cancer cell line 

30 are significandy altered by the overexpression of HER-2/neu (Pietras, R. J., 
Fendly, B. M., Chazin, V. R., Pegram, M. D., Howell, S. B. and Slamon, D. J. 
(1994) Oncogene, 9(7), 1829-38; Pietras, R. J., Arboleda, J., Reese, D. M., 
Wongvipat, N., Pegram, M. D., Ramos, L., Gorman, C. M., Parker, M. G., 
Sliwkowski, M. X. and Slamon, D. J. (1995) Oncogene, 10(12), 2435-46; Pegram, M. 
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D., Finn, R. S., Arzoo, K., Betyt, M., Pietras, R. J. and Slamon, D. J. (1997) 
Oncogene, 15(5), 537-47; Chazin, V.R. (1991). The biologic effects of HER-2//W/ 
proto-oncogene overexpression, Chapter 2. Department of Microbiology and 
Immunology, University of California, Los Angeles; Chazin, V. R., Kaleko, M., 
5 Miller, A. D. and Slamon, D. J. (1992) Oncogene, 7(9), 1859-66). Increased cell 
proliferation was seen in the HER-2 overexpressing cell line as assessed by ^H- 
thymidine incorporation and in vitro cell proliferation assays. In addition, HER-2 
overexpression markedly improved soft agar cloning efficiency, and the cells 
exhibited increased tumorigenicity in nude mice ( Chazin, V.R. (1991). The 

10 biologic effects of HER-2/#«/ proto-oncogene overexpression, Chapter 2. 
Department of Microbiology and Immunology, University of California, Los 
Angeles; Chazin, V. R., Kaleko, M., Miller, A. D. and Slamon, D. J. (1992) 
Oncogene, 7(9), 1859-66). Together, the data confirmed that overexpression of the 
HER-2 receptor tyrosine kinase plays a role in altering the biologic behavior of 

15 human breast cancer cells. The exact molecular mechanism(s) by which this 
overexpression promotes a more aggressive phenotype of these cells, however, 
remains unknown. There are multiple potential mechanisms by which the 
observed phenotypic changes may occur. Increased amounts and/ or activation 
of this cell surface receptor may affect either the expression or function of other 

20 molecules involved in regulation of cell proliferation. Direct effects of HER-2 
overexpression on other cellular proteins can be accomplished by changes in 1) 
expression at the rnRNA transcript level,. 2) protein production at the 
translational level, or 3) protein activation / modification at the post-translational 
level. The cellular changes associated with HER-2/ neu overexpression are likely 

25 to be induced by most or all of these mechanisms. To identify those changes 
associated with differential expression of genes at the transcript level, we 
undertook a differential screening analysis. 

The subtraction cloning technique termed differential hybridization, also 
known as plus/minus screening (St John, T. P. and Davis, R. W. (1979) Cell, 

30 16(2), 443-52), was used to isolate genes which are differentially expressed in 
MCF-7/HER-2 cells as compared to MCF-7/control cells. This approach has 
the advantage of comparing two human breast cancer cell lines which are 
identical except for HER-2/ neu overexpression allowing for a direct comparison 
of cDNAs derived fitom the two cell populations. In the current study, we 
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identified a series of genes, either previously characterized or entirely novel, 
whose expression levels are altered in association with HER-2/ neu 
overexpression. It is possible that some of these genes might be mediators of the 
HER-2 overexpressing phenotype since we have confirmed their differential 
5 expression not only in human ovarian cancer cell lines which overexpress HER-2 
but also primary breast cancer specimens containing this alteration. 

The differential screening approach compared MCF-7 breast cancer cell 
lines trans fected with a human HER-2/ neu cDNA (MCF-7/HER-2) or with an 
identical empty vector (MCF-7 /control). The alternative approach of comparing 

10 two different non-engineered cell lines which are not isogenic i.e. MCF-7 and/or 
MDA-MB-231 compared against SKBR3 and/or BT-474, respectively, is 
problematic in that the presence of non-HER-2 associated genetic differences 
unique to cells derived from different individuals would almost certainly 
complicate interpretation of results. Such heterogenetic effects would confound 

15 identification of those genes which are differentially expressed in direct 
association with HER-2/ mu overexpression. A relatively conventional 
subtraction cloning method termed differential hybridization has been 
successfully used by other investigators in the cloning of genes associated with 
various biologic phenomenon including the galactose-inducible genes of yeast (St 

20 John, T. P. and Davis, R. W. (1979) Ceil, 16(2), 443-52), human fibroblast 
interferon (Taniguchi, T., Fujii-Kuriyama, Y. and Muramatsu, M. (1980) ProcNall 
Acad Set USA, 77(7), 4003-6), a variety of heat-shock proteins (Mason, I. J., 
Taylor, A., Williams, J. G., Sage, H. and Hogan, B. L. (1986) Embo /, 5(7), 1465^ 
72), and the metastasis suppressor gene nm-23 (Steeg, P. S., Bevilacqua, G., 

25 Kopper, L., Thorgeirsson, U. P., Talmadge, J. E r , Liotta, L. A. and Sobel, M. E. 
(1988) J Natl Cancer Inst, 80(3), 200-4). This screening strategy has the advantage 
of obtaining a high yield of full-length clones in contrast to more recent 
techniques such as differential display or representational difference analysis 
(RDA) which require an additional procedure of screening a cDNA library using 

30 the DNA fragments obtained. 

From our initial screen of 16,000 MCF-7/HER-2 cDNA library clones, 
we identified five genes with decreased and eleven genes with increased 
expression levels in association with HER-2/ neu overexpression. These clones 
include nine genes with previously identified cellular functions, three existing 
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sequences of relatively uncharacterized function, and four novel genes without 
matching sequences in GenBank. A number of the known genes identified in 
our screening have been previously reported to be associated with several aspects 
of human breast cancer and/or tumorigenicity in general. Although the 
5 differential screening approach does not provide direct evidence that a given gene 
plays a critical role in the phenotypic changes associated with HER-2 
overexpression, a review of the literature regarding some of the genes in our 
study indicates that they may be candidates. Recent data, for example, indicates 
that downregulation of cytokeratin (C29, C49) gene expression may result in 

10 disorganization of the cytoskeleton leading to enhanced invasive properties 
(Mukhopadhyay, T. and Roth, J. A. (1996) Anticancer Res, 16(1), 105-12). 
Similarly, the gamma actin (C72) transcript level is markedly decreased in salivary 
gland adenocarcinoma cells on acquisition of metastatic ability (Suzuki, H., 
Nagata, H., Shimada, Y. and Konno, A. (1998) Int] Oncol, 12(5), 1079-84). These 

15 observations are consistent with our findings and are of interest given the fact 
that HER-2 overexpression is associated with increased metastatic potential 
(Kennedy, M. J. (1996) Curr Opin Oncol t 8(6), 485-90; Pantel, K., Schlimok, G., 
Braun, S., Kutter, D., Lindemann, F., Schaller, G., Funke, I., Izbicki, J. R. and 
Riethmuller, G. (1993) / Natl Cancer Inst, 85(17), 1419-24; Kallioniemi, O. P., 

20 HoUi, K., Visakorpi, T., Koivula, T., Helin, H. H. and Isola, J. J. (1991) In/ J 
Cancer, 49(5), 650-5). The observation that Cathepsin D (C31) transcript level is 
decreased in HER-2 overexpressing breast cancer cells is consistent with the 
most recent clinical data (Johnson, M. D., Torri, J. A., Lippman, M. E. and 
Dickson, R. B. (1993) Cancer Res, 53(4), 873-7; Ravdin, P. M., Tandon, A. K., 

25 AUred, D. C, Clark, G. M., Fuqua, S. A., Hilsenbeck, S. H., Chamness, G. C. and 
Osborne, C. KL (1994) / Clin Oncol, 12(3), 467-74) which contradict the original 
reports of high Cathepsin D concentrations as indicative of a poorer prognosis 
(Thorpe, S. M, Rochefort, H., Garcia, M., Freiss, G., Christensen, I. J., Khalaf, 
S., Paolucci, F., Pau, B., Rasmussen, B. B. and Rose, C. (1989) Cancer Res, 49(21), 

30 6008-14). The 90 kDa heat shock protein (HI 8) forms highly stable complexes 
with the estrogen receptor and thus may play a role in mediating estrogen- 
dependent growth (Ramachandran, C, Catelli, M. G., Schneider, W. and 
Shyamala, G. (1988) Endocrinology, 123(2), 956-61; Shyamala, G., Gauthier, Y., 
Moore, S. K., Catelli, M. G. and Ullrich, S.J. (1989) Mol Cell Biol, 9(8), 3567-70). 
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Its potential role in regulating estrogen receptor activity in human breast cancer is 
interesting in light of the interactions recently described between HER-2 and the 
estrogen receptor (Carlomagno, C, Perrone, F., Gallo, C, De Laurentiis, M., 
Lauria, R., Morabito, A., Pettinato, G., Panico, L., D'Antonio, A., Bianco, A. R. 
5 and De Placido, S. (1996) / CHn Oncol, 14(10), 2702-8; Ignar-Trowbridge, D. M., 
Nelson, K. G., Bidwell, M. C, Curtis, S. W., Washburn, T. F., McLachlan, J. A. 
and Korach, K. S. (1992) Pwc Natl Acad Sci USA, 89(10), 4658-62). Other 
known genes found in our screening to be overexpressed in association with 
HER-2 overexpression, ribosomal proteins L8 (HI 6) and LLrep3(H35), GAPDH 

10 (H31), and succinyl coA transferase (H45), may be merely reflective of higher 
proliferation in HER-2 overexpressing tumors. Alternatively, differential 
expression of these genes may be more specifically linked to HER-2 
overexpression. An example of this could be the LLrep3 which was also 
identified in differential hybridization screening of a ras- trans fee ted 

15 teratocarcinoma cell line compared to isogenic cell control as increased 25-fold 
(Chiao, P. J., Shin, D. M., Sacks, P. G., Hong, W. K. and Tainsky, M. A. (1992) 
Mol Carcinog, 5(3), 219-31), however, this gene is not differentially expressed when 
comparing nontumorigenic and tumorigenic NIH 3T3 cells transformed by Ha- 
ras, N-ras, v-myc, v-mos, v-src and v-abl (Chiao, P. J., Shin, D. M., Sacks, P. G., 

20 Hong, W. K. and Tainsky, M. A. (1992) Mol Carcinog, 5(3), 219-31). 

DNA fragmentation factor (DFF) (HI 3) is also overexpressed in the 
HER-2 overexpressing cells and has recently been identified as a protein which 
functions downstream of caspase-3 during apoptosis (Liu, X., Zou, H., Slaughter, 
C. and Wang, X. (1997) Cell, 89(2), 175-84). Its exact cellular role in this process, 

25 i.e. inhibition or promotion of apoptosis, however, is as yet undefined (Mitamura, 
S., Ikawa, H., Mizuno, N., Kaziro, Y. and Itoh, H. (1998) Biochem Biophys Res 
Commun, 243(2), 480-4; Inohara, N., Koseki, T., Chen, S., Wu, X. and Nunez, G. 
(1998) Embo J, 17(9), 2526-33; Granville, D. J.Jiang, H, An, M. T., Levy, J. Gi, 
McManus, B, M. and Hunt, D. W. (1998) FEBS Lett, 422(2), 151-4). Lasdy, 

30 DRP-1 (Density regulated protein-1) (HI 4), which is also increased in association 
with HER-2 overexpression has been found to be preferentially expressed in cells 
grown at high density compared to cells at low density. Growth arrest by serum 
starvation or transforming growth factor B treatment does not however induce 
this gene's expression (Deyo, J. E., Chiao, P. J. and Tainsky, M. A. (1998) DNA 
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Cell Biol, 17(5), 437-47). Its role, if any, in the HER-2 phenotype remains to be 
determined. 

The possibility that the pattern of differential gene expression observed in 
this study is unique to a given experimental cell line rather than a generic 
5 phenomenon associated with HER-2 overexpression was also addressed. To 
verify differential expression in another cell line, we utilized CaOv-3 ovarian 
cancer cells engineered to overexpress neu. For 75% of the differentially 

expressed clones, the patterns identified in the breast cancer cells were also found 
in the human ovarian cancer cell counterparts. This consistent expression 

10 pattern, demonstrated across cell lines from two different epithelia (i.e. breast and 
ovary), suggest that the expression differences observed in our study are related 
to HER-2/«e// overexpression. In addition, we found a correlation between 
overexpression of HER-2/ w# and upregulation of the H37 and H41 genes in 
actual human breast cancer specimens. Those genes which did not yield a signal 

15 on Northern analysis likely due to rare message level are currendy being evaluated 
by a quantitative RT-PCR approach to circumvent this difficulty. Given the 
problem in assessing Northern blot analyses from whole tissue specimens 
resulting from dilutional artifacts introduced by surrounding normal cells, these 
correlations are encouraging. It is intriguing that the H37 cDNA, found to be 

20 overexpressed in HER-2 overexpressing cells in the current study and 
demonstrating convincing differential expression in actual tumor samples, is 
localized to a region of chromosome 3p21.3 alleged to contain a. putative lung 
cancer tumor suppressor gene(s) (Wei, M. H., Latif, R, Bader, S., Kashuba, V., 
Chen, J. Y., Duh, F. M., Sekido, Y., Lee, C. C, Geil, L., Kuzmin, I., Zabarovsky, 

25 E., Klein, G., Zbar, B., Minna, J. D. and Lerman, M. I. (1996) Cancer Res, 56(7), 
1487-92). Further characterization of this gene at the functional and genomic 
levels should give further insight into this phenomenon. 

The current studies indicate that HER-2/ neu overexpression induces a 
pattern of consistent genetic alterations in target human cells. We recognize that 

30 there are more sensitive techniques such as microarray chip technology now 
available for evaluating differential gene expression and plan to reanalyze these 
cell line pairs using these newer approaches. It is possible that some of the genes 
identified may in part be biologic mediators of the aggressive biologic behavior 
associated with HER-2/ neu overexpression. Future elucidation of role of these 
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genes, in particular those with as yet unknown function, in mediating malignant 
phenotype should provide further insights into the fundamental biology and 
pathogenetic effects of HER-2/ neu overexpression in human breast and ovarian 
cancer cells and suggests novel treatment strategies for patients whose tumors 
5 contain these alterations. 

USES FOR HOMPS GENES AND GENE PRODUCTS DESCRIBED 
HEREIN 

Skilled artisans understand both the great diagnostic value that known 

10 oncogenesis associated markers such as Her-2 and PSA provide in the 
monitoring of cancers in patients as well as the need for the identification of 
additional oncogenesis associated markers (see e.g. Bostwick et al., J Cell 
Biochem Suppl 1996;25:156-64 and Morote et aL, Int J Cancer 1999 Aug 
20;84(4):421-5). In particular, artisans understand that oncogenesis is a multistep 

15 process and the identification of a variety of different oncogenesis associated 
markers can be used to identify and characterize precancerous and cancerous 
syndromes earlier and more efficiendy (see e.g. Rhim et al, Cancer Res 1990 Sep 
1;50(17 Suppl):5653S-5657 ). In this context, the specific properties of the 
HOMPS proteins described herein (e.g. their modulation by a Her-2, an 

20 oncogene which plays a role significant role in a number of human cancers 
including breast cancer) includes them in the class of oncogenesis associated 
markers that can be used to evaluate and/or evaluate oncogenetic processes in 
cancers. Understandably, a number of the HOMPS proteins disclosed herein 
have been independendy identified by other artisans as oncogenesis associated 

25 markers which can be used to examine growth deregulation in conditions such as 
cancer (see e.g. Yano et al., Jpn J Cancer Res 1996 Sep;87(9):908-15 [hsp90]; 
Chou et al., Proc Nad Acad Sci U S A 1987 May ;84(9): 2575-9 [gamma actin] and 
Tonkin et al, Cancer Prev Control 1999 Apr;3(2):131-6 [cathepsin-D]). 

As disclosed herein, HOMPS gene products exhibit specific properties 

30 that are analogous to those found in a family of genes whose polynucleotides, 
polypeptides and anti-polypeptide antibodies are used in well known diagnostic 
assays directed to exarriining conditions associated with disregulated cell growth 
such as cancer. Well known members of this class include Her-2 as well as PSA, 
the archetypal markers that have been used by medical practitioners for years to 
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identify and monitor the presence of cancers such as prostate cancer (see e.g. 
Merrill et aL, J. Urol. 163(2): 503-5120 (2000); Polascik et aL, J. Urol. 
Aug;162(2):293-30.6 (1999) and Fortier et aL, J. Nat. Cancer Inst. 91(19): 1635- 
1640(1999)). A variety of other diagnostic markers are also used in this context 
5 including p53 and K-ras (see e.g. Tulchinsky et al., Int J Mol Med 1999 
Jul;4(l):99-102 and Minimoto et al, Cancer Detect Prev 2000;24(1):1-12). 
Consequendy, this disclosure of the HOMPS polynucleotides and polypeptides 
(as well as the HOMPS polynucleotide probes and anti-HOMPS antibodies used 
to identify the presence of these molecules) and their properties allows skilled 

10 artisans to utilize these molecules in methods that are analogous to those used, 
for example, in a variety of diagnostic assays directed to examining conditions 
associated with cancer. 

Typical embodiments of diagnostic methods which utilize the HOMPS 
polynucleotides, polypeptides and antibodies described herein are analogous to 

15 those methods from well established diagnostic assays which employ Her-2 and 
PSA polynucleotides, polypeptides and antibodies. For example, just as PSA 
polynucleotides are used as probes (for example in Northern analysis, see e.g. 
Sharief et al., Biochem. Mol. Biol. Int. 33(3): 567-74(1 994)) and primers (for 
example in PCR analysis, see e.g. Okegawa et al., J. Urol. 163(4): 1189-1190 

20 (2000)) to observe the presence and/or the level of PSA mRNAs in methods of 
monitoring PSA overexpression or the metastasis of prostate cancers, the 
HOMPS polynucleotides described herein can be utilized in the same way to 
evaluate or monitor the cellular growth disregulation that is associated with 
cancer. Alternatively, just as PSA polypeptides are used to generate antibodies 

25 specific for PSA which can then be used to observe the presence and/ or the level 
of PSA proteins in methods of monitoring PSA protein overexpression (see e.g. 
Stephan et al., Urology 55(4):560-3 (2000)) or the metastasis of prostate cells (see 
e.g. Alanen et aL, Pathol. Res. Pract. 192(3):233-7 (1996)), the HOMPS 
polypeptides described herein can be utilized to generate antibodies for use in 

30 detecting HOMPS overexpression as seen in cells expected of exhibiting some 
form of growth disregulation. 

Just as Her-2 and PSA polynucleotide fragments and polynucleotide 
variants are employed by skilled artisans for use in methods of monitoring this 
molecule, HOMPS polynucleotide fragments and polynucleotide variants can also 
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„ be used in an analogous manner. In particular, typical PSA polynucleotides used 
in methods of monitoring this molecule are probes or primers which consist of 
fragments of the PSA cDNA sequence. Illustrating this, primers used to PCR 
amplify a PSA polynucleotide must include less than the whole PSA sequence to 
5 function in the polymerase chain reaction. In the context of such PCR reactions, 
skilled artisans generally create a variety of different polynucleotide fragments 
that can be used as primers in order to amplify different portions of a 
polynucleotide of interest or to optimize amplification reactions (see e.g. 
Caetano-Anolles, G. Biotechniques 25(3): 472-476, 478-480 (1998); Robertson et 

10 al., Methods Mol. Biol. 98:121-154 (1998)). In addition, in order to facilitate their 
use by medical practitioners, variant polynucleotide sequences are typically used 
as primers and probes for the corresponding mRNAs in PCR and Northern 
analyses (see e.g. Sawai et al, Fetal Diagn. Ther. 1996 Nov-Dec;ll(6):407-13 and 
Current Protocols In Molecular Biology, Volume 2, Unit 2, Frederick M. Ausubul 

15 et al. eds., 1995)). Polynucleotide fragments and variants are typically useful in 
this context as long as they have the common attribute or characteristic of being 
capable of binding to a target polynucleotide sequence (e.g. the HOMPS 
polynucleotide shown in Figure 2) under conditions of high stringency. 

Just as Her-2 and PSA polypeptide fragments and polypeptide variants 

20 are employed by skilled artisans for use in methods of monitoring this molecule, 
HOMPS polypeptide fragments and polypeptide variants can also be used in an 
analogous manner. In particular, typical PSA polypeptides used in methods of 
monitoring this molecule are fragments of the PSA protein which contain an 
epitope that can be recognized by an antibody which will specifically bind to the 

25 PSA protein. This practice of using polypeptide fragments or polypeptide 
variants used to generate antibodies (such as anti-PSA antibodies) is typical in the 
art with a wide variety of systems such as fusion proteins being used by 
practitioners (see e.g. Current Protocols In Molecular Biology, Volume 2, Unit 
1 Frederick M. Ausubul et al. eds., 1995). In this context, each of the variety of 

30 epitopes in a protein of interest functions to provide the architecture upon which 
the antibody is generated. Typically, skilled artisans generally create a variety of 
different polypeptide fragments that can be used in order to generate antibodies 
specific for different portions of a polypeptide of interest (see e.g. U.S. Patent 
No. 5,840,501 and U.S. Patent No. 5,939,533). For example it may be preferable 
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to utilize a polypeptide comprising one of the HOMPS biological motifs 
discussed below. Polypeptide fragments and variants are typically useful in this 
context as long as they have the common attribute or characteristic of having an 
epitope capable of generating an antibody specific for a target polypeptide 
5 sequence (e.g. the HOMPS polypeptide shown in Figure 2). 

As shown herein, the HOMPS polynucleotides and polypeptides (as well 
as the HOMPS polynucleotide probes and anti-HOMPS antibodies used to 
identify the presence of these molecules) exhibit specific properties that can make 
them useful in examining cancerous cells or tissues. The described diagnostic 

10 assays that measures the presence of HOMPS gene products, in order to provide 
evidence of growth deregulation are particularly useful in identifying potential 
candidates for preventive measures or further monitoring, as has been done so 
successfully with Her-2 and PSA (see e.g. Scheurle et al, Anticancer Res. 2000 
May-Jun;20(3B):2091-6; Fontana et aL, Anticancer Res 1994 Sep- 

15 Oct;14(5B):2099-104 and Sahin, Adv Anat Pathol 2000 May;7 (3): 158-66). 

HOMPS EMBODIMENTS 

As disclosed herein, the invention is directed to Her-2/neu 
overexpression modulated proteins (HOMPS) genes and HOMPS gene products 

20 as well as HOMPS antibodies and assays for detecting these molecules. 
Typically, the invention encompasses HOMPS proteins as well as the 
polynucleotides which encode HOMPS proteins. Accordingly, any nucleic acid 
sequence which encodes the amino acid sequence of HOMPS can be used to 
generate recombinant molecules which express HOMPS. In a particular 

25 embodiment, the invention encompasses the polynucleotide comprising the 
nucleic acid sequence of Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10. The 
invention encompasses HOMPS variants which retain biological or other 
functional activity of HOMPS. A preferred HOMPS variant is one having at least 
80%, and more preferably 90%, amino acid sequence identity to the HOMPS 

30 amino acid sequence of Figure 2, Figure 4, Figure 6, Figure 9 or Figure 11. A 
most preferred HOMPS variant is one having at least 95% amino acid sequence 
identity to the amino acid sequence of Figure 2, Figure 4, Figure 6, Figure 9 or 
Figure 1 1 . 
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It will be appreciated by those skilled in the art that as a result of the 
degeneracy of the genetic code, a multitude of nucleotide sequences encoding 
HOMPS, some bearing minimal homology to the nucleotide sequences of any 
known and naturally occurring gene, may be produced. Thus, the invention 
5 contemplates each and every possible variation of nucleotide sequence that could 
be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as 
applied to the nucleotide sequence of naturally occurring HOMPS, and all such 
variations are to be considered as being specifically disclosed. 

10 Although nucleotide sequences which encode HOMPS and its variants 

are preferably capable of hybridizing to the nucleotide sequence of the naturally 
occurring HOMPS under appropriately selected conditions of stringency, it may 
be advantageous to produce nucleotide sequences encoding HOMPS or its 
derivatives possessing a substantially different codon usage. Codons may be 

15 selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other reasons for substantially 
altering the nucleotide sequence encoding HOMPS and its derivatives without 
altering the encoded amino acid sequences include the production of RNA 

20 transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences, or 
portions thereof, which encode HOMPS and its derivatives, entirely by synthetic 
chemistry. After production, the synthetic sequence may be inserted into any of 

25 the many available expression vectors and cell systems using reagents that are 
well known in the art at the time of the filing of this application. Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence encoding 
HOMPS or any portion thereof. 

Also encompassed by the invention are polynucleotide sequences that are 

30 capable of hybridizing to the claimed nucleotide sequences, and in particular, 
those shown in Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10, under various 
conditions of stringency. Hybridization conditions are based on the melting 
temperature (Tm) of the nucleic acid binding complex or probe, as taught in 
Wahl, G. M. and S. L. Berger (1987; Methods Enzymol. 152:399-407) and 
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Kimmel, A. R. (1987; Methods Enzyraol. 152:507-511), and may be used at a 
defined stringency. 

Altered nucleic acid sequences encoding HOMPS which are 
encompassed by the invention include deledons, insertions, or substitutions of 
5 different nucleotides resulting in a polynucleotide that encodes the same or a 
functionally equivalent HOMPS. The encoded protein may also contain 
deletions, insertions, or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent HOMPS. Deliberate amino 
acid substitutions may be made on the basis of similarity in polarity, charge, 

10 solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues as long as the biological activity of HOMPS is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid; 
positively charged amino acids may include lysine and arginine; and amino acids 
with uncharged polar head groups having similar hydrophilicity values may 

15 include leucine, isoleucine, and valine; glycine and alanine; asparagine and 
glutarnine; serine and threonine; phenylalanine and tyrosine. 

As discussed above, the invention is directed to Her-2/neu 
overexpression modulated proteins (HOMPS). The typical embodiments of the 
invention discussed herein (e.g. polynucleotides, polypeptides, antibodies and 

20 assays for HOMPS gene products etc.) are directed to all of the HOMPS genes 
and gene products (i.e. H13, H14, H17, H41, H63 and C40). In descriptions of 
the invention provided herein, embodiments of a single HOMPS gene are used 
(for example the H41 gene) to illustrate typical embodiments of the invention 
that apply to all of the HOMPS molecules provided herein. In this context, 

25 artisans understand that the use of a single HOMPS molecule in illustrative 
typical embodiments common to all of the HOMPS molecules eliminates 
unnecessary redundancy in the description of the invention. 

One aspect of the invention provides polynucleotides corresponding or 
complementary to all or part of a HOMPS gene, mRNA, and/or coding 

30 sequence, preferably in isolated form, including polynucleotides encoding a 
HOMPS protein and fragments thereof, DNA, RNA, DNA/RNA hybrid, and 
related molecules, polynucleotides or oligonucleotides complementary to a 
HOMPS gene or mRNA sequence or a part thereof, and polynucleotides or 
oligonucleotides that hybridize to a HOMPS gene, mRNA, or to a HOMPS 
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encoding polynucleotide (collectively, "HOMPS polynucleotides"). As used 
herein, the HOMPS gene and protein is meant to include the HOMPS genes and 
proteins specifically described herein. 

One illustrative embodiment of a typical HOMPS polynucleotide is a 
5 polynucleotide having the H41 sequence shown in Figure 5 (SEQ ID NO: 5). A 
H41 polynucleotide may comprise a polynucleotide having the nucleotide 
sequence of human H41 as shown in FIG. 5 (SEQ ID NO: 5), wherein T can 
also be U; a polynucleotide that encodes ail or part of the H41 protein; a 
sequence complementary to the foregoing; or a polynucleotide fragment of any 
10 of the foregoing. Another embodiment comprises a polynucleotide that is 
capable of hybridizing under stringent hybridization conditions to the human 
H41 cDNA shown in FIG. 5 (SEQ ID NO: 5) or to a polynucleotide fragment 
thereof. 

Typical embodiments of the invention disclosed herein include H41 

15 polynucleotides containing specific portions of the H41 rriRNA sequence (and 
those which are complementary to such sequences) such as those that encode the 
protein and fragments thereof. For example, representative embodiments of the 
invention disclosed herein include: polynucleotides encoding about amino acid 1 
to about amino acid 10 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), 

20 polynucleotides encoding about amino acid 20 to about amino acid 30 of the 
H41 protein shown in FIG. 5 (SEQ ID NO: 5), polynucleotides encoding about 
amino acid 30 to about amino acid 40 of the H41 protein shown in FIG. 5 (SEQ 
ID NO: 5), polynucleotides encoding about amino acid 40 to about amino acid 
50 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), polynucleotides 

25 encoding about amino acid 50 to about amino acid 60 of the H41 protein shown 
in FIG. 5 (SEQ ID NO: 5), polynucleotides encoding about amino acid 60 to 
about amino acid 70 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), 
polynucleotides encoding about amino acid 70 to about amino acid 80 of the 
H41 protein shown in FIG. 5 (SEQ ID NO: 5), polynucleotides encoding about 

30 amino acid 80 to about amino acid 90 of the H41 protein shown in FIG. 5 (SEQ 
ID NO: 5) and polynucleotides encoding about amino acid 90 to about amino 
acid 100 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), etc. Following 
this scheme, polynucleotides encoding portions of the amino acid sequence of 
amino acids 100-258 of the H41 protein are typical embodiments of the 
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invention. Polynucleotides encoding larger portions of the H41 protein are also 
contemplated. For example polynucleotides encoding from about amino acid 1 
(or 20 or 30 or 40 etc.) to about amino acid 20, (or 30, or 40 or 50 etc.) of the 
H41 protein shown in FIG. 5 (SEQ ID NO: 5) may be generated by a variety of 
5 techniques well known in the art. 

Additional illustrative embodiments of H41 polynucleotides include 
embodiments consisting of a polynucleotide having the sequence as shown in 
FIG. 5 (SEQ ID NO: 5) from about nucleotide residue number t through about 
nucleotide residue number 500, from about nucleotide residue number 500 

10 through about nucleotide residue number 1000 and from about nucleotide 
residue number 500 through about nucleotide residue number 1000 and from 
about nucleotide residue number 1000 through about nucleotide residue number 
1500 and from about nucleotide residue number 1500 through about nucleotide 
residue number 2000 and from about nucleotide residue number 2000 through 

15 about nucleotide residue number 2500 and from about nucleotide residue 
number 2500 through about nucleotide residue number 3000 and from about 
nucleotide residue number 3000 through about nucleotide residue number 3346. 
These polynucleotide fragments can be of any size an include any portion of the 
H41 sequence as shown in FIG. 5 (SEQ ID NO: 5), for example a 

20 polynucleotide having the sequence as shown in FIG, 5 (SEQ ID NO: 5) from 
about nucleotide residue number 324 through about nucleotide residue number 
2248. 

Another aspect of the present invention provides H41 proteins and 
polypeptide fragments thereof. The H41 proteins of the invention include those 

25 specifically identified herein, as well as allelic variants, conservative substitution 
variants and homologs that can be isolated/generated and characterized without 
undue experimentation following the methods outlined below. Fusion proteins that 
combine parts of different H41 proteins or fragments thereof, as well as fusion 
proteins of a H41 protein and a heterologous polypeptide are also included. Such 

30 H41 proteins will be collectively referred to as the H41 proteins, the proteins of the 
invention, or H41. As used herein, the term "H41 polypeptide" refers to a 
polypeptide fragment or a H41 protein of at least 6 amino acids, preferably at least 
1 5 amino acids. 



31 



WO 01/25250 



PCT/US00/27649 



Specific embodiments of H41 proteins comprise a polypeptide having the 
amino acid sequence of human H41 as shown in FIG. 6 (SEQ ID NO: 6). 
Alternatively, embodiments of H41 proteins comprise variant polypeptides having 
alterations in the amino acid sequence of human H41 as shown in FIG. 6 (SEQ ID 
5 NO: 6). 

In general, naturally occurring allelic variants of human HOMPS such as 
H41 will share a high degree of structural identity and homology (e.g., 90% or more 
identity). Typically, allelic variants of the HOMPS proteins will contain conservative 
amino acid substitutions within the HOMPS sequences described herein or will 

10 contain a substitution of an amino acid from a corresponding position in a HOMPS 
homologue. One class of HOMPS allelic variants will be proteins that share a high 
degree of homology with at least a small region of a particular HOMPS amino acid 
sequence, but will further contain a radical departure from the sequence, such as a 
non-conservative substitution, truncation, insertion or frame shift 

15 Conservative amino acid substitutions can frequendy be made in a 

protein without altering either the conformation or the function of the protein. 
Such changes include substituting any of isoleucine (I), valine (V), and leucine (L) 
for any other of these hydrophobic amino acids; aspartic acid (D) for glutamic 
acid (E) and vice versa; glutarnine (Q) for asparagine (N) and vice versa; and 

20 serine (S) for threonine (I) and vice versa. Other substitutions can also be 
considered conservative, depending on the environment of the particular amino 
acid and its role in the three-dimensional structure of the protein. For example, 
gl/cine (G) and alanine (A) can frequendy be interchangeable, as can alanine (A) 
and valine (V). Methionine (M), which is relatively hydrophobic, can frequendy 

25 be interchanged with leucine and isoleucine, and sometimes with valine. Lysine 
(K) and arginine (R) are frequendy interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of 
these two amino acid residues are not significant Still other changes can be 
considered "conservative" in particular environments. 

30 Embodiments of the invention disclosed herein include a wide variety of 

art accepted variants of HOMPS proteins such as polypeptides having amino acid 
insertions, deletions and substitutions. HOMPS variants can be made using 
methods known in the art such as site-directed mutagenesis, alanine scanning, 
and PCR mutagenesis. Site-directed mutagenesis (Carter et a]., 1986, Nucl. Acids 
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Res. 13:4331; Zoller et al, 1987, Nucl. Acids Res. 10:6487), cassette mutagenesis 
(Wells et al., 1985, Gene 34:315), restriction selection mutagenesis (Wells et al., 
1986, Philos. Trans. R. Soc. London Ser. A, 317:415) or other known techniques 
can be performed on the cloned DNA to produce the HOMPS variant DNA. 
5 Scanning amino acid analysis can also be employed to identify one or more 
amino acids along a contiguous sequence. Among the preferred scanning amino 
acids are relatively small, neutral amino acids. Such amino acids include alanine, 
glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-carbon 

10 and is less likely to alter the main-chain conformation of the variant. Alanine is 
also typically preferred because it is the most common amino acid. Further, it is 
ftequendy found in both buried and exposed positions (Creighton, The Proteins, 
(W.H. Freeman & Co., N.Y.); Chothia, 1976, J. MoL Biol, 150:1). If alanine 
substitution does not yield adequate amounts of variant, an isosteric amino acid 

15 can be used. 

As defined herein, HOMPS variants have the a^stinguishing attribute of 
having at least one epitope in common with a HOMPS protein (such as the H41 
protein having the amino acid sequence of FIG. 6 (SEQ ID NO: 6)), such that an 
antibody that specifically binds to a HOMPS variant will also specifically bind to 

20 the HOMPS protein (such as the HOMPS protein having the amino acid 
sequence of FIG. 6 (SEQ ID NO: 6)). Using H41 as an illustrative example, a 
polypeptide ceases to be a variant of the H41 protein shown in FIG. 6 (SEQ ID 
NO: 6) when it no longer contains an epitope capable of being recognized by an 
antibody that specifically binds to a H41 HOMPS protein. Those skilled in the 

25 art understand that antibodies that recognize proteins bind to epitopes of varying 
size, and a grouping of the order of about six amino acids, contiguous or not, is 
regarded as a typical number of amino acids in a minimal epitope. See e.g. 
Hebbes et al, Mol Immunol (1989) 26(9):865-73; Schwartz et al, J Immunol 
(1985) 135(4):2598-608. As there are approximately 20 arnino acids that can be 

30 included at a given position within the minimal 6 amino acid epitope, the odds of 
such an epitope occurring by chance are about 20 6 or about 1 in 64 million. 
Another specific class of HOMPS protein variants shares 90% or more identity 
with the amino acid sequence of FIG. 6 (SEQ ID NO: 6). 
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As discussed above, embodiments of the claimed invention include 
polypeptides containing less than the 258 amino acid sequence of the H41 
piotein shown in FIG. 6 (SEQ ID NO: 6) (and the polynucleotides encoding 
such polypeptides). For example, representative embodiments of the invention 
5 disclosed herein include polypeptides consisting of about amino acid 1 to about 
amino acid 10 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), 
polypeptides consisting of about amino acid 20 to about amino acid 30 of the 
H41 protein shown in FIG. 6 (SEQ ID NO: 6), polypeptides consisting of about 
amino acid 30 to about amino acid 40 of the H41 protein shown in FIG. 6 (SEQ 

10 ID NO: 6), polypeptides consisting of about amino acid 40 to about amino acid 
50 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), polypeptides consisting 
of about amino acid 50 to about amino acid 60 of the H41 protein shown in 
FIG. 6 (SEQ ID NO: 6), polypeptides consisting of about amino acid 60 to 
about amino acid 70 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), 

15 polypeptides consisting of about amino acid 70 to about amino acid 80 of the 
H41 protein shown in FIG. 6 (SEQ ID NO: 6), polypeptides consisting of about 
amino acid 80 to about amino acid 90 of the H41 protein shown in FIG. 6 (SEQ 
ID NO: 6) and polypeptides consisting of about amino acid 90 to about amino 
acid 100 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), etc. Following 

20 this scheme, polypeptides consisting of portions of the amino acid sequence of 
amino acids 100-258 of the H41 protein are typical embodiments of the 
invention. Polypeptides consisting of larger portions of the H41 protein are also 
contemplated. For example polypeptides consisting of about amino acid 1 . (or 20 
or 30 or 40 etc.) to about amino acid 20, (or 30, or 40 or 50 etc.) of the H41 

25 protein shown in FIG. 6 (SEQ ID NO: 6) may be generated by a variety of 
techniques well known in the art. 

Also included within the scope of the present invention are alleles of the 
genes encoding HOMPS. As used herein, an "allele" or "allelic sequence" is an 
alternative form of the gene which may result from at least one mutation in the 

30 nucleic acid sequence. Alleles may result in altered mRNAs or polypeptides 
whose structure or function may or may not be altered Any given gene may have 
none, one, or many allelic forms. Common mutational changes which give rise to 
alleles are generally ascribed to natural deletions, additions, or substitutions of 
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nucleotides. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

Methods for DNA sequencing which are well known and generally available in 
the art may be used to practice any embodiments of the invention. The methods 
5 may employ such enzymes as the Klenow fragment of DNA polymerase I, 
SEQUENASE (US Biochemical Corp, Cleveland, Ohio), Taq polymerase (Perkin 
Elmer), thermostable T7 polymerase Amersham Pharmacia Biotech (Pisctaway 
N. J. ), or combinations of recombinant polymerases and proofreading 
exonucleases such as the ELONGASE Amplification System marketed by Life 

10 Technologies (Gaithersburg, Md. ). 

Preferably, the process is automated with machines such as the Hamilton Micro 
Lab 2200 (Hamilton, Reno, Nev. ), Peltier Thermal Cycler (PTC200; MJ 
Research, Watertown, Mass. ) and the ABI 377 DNA sequencers (Perkin Elmer). 
The nucleic acid sequences encoding HOMPS may be extended utilizing 

15 a partial nucleotide sequence and employing various methods known in the art to 
detect upstream sequences such as promoters and regulatory elements. For 
example, one method which may be employed, "restriction-site" PCR, uses 
universal primers to retrieve unknown sequence adjacent to a known locus 
(Sarkar, G. (1993) PCR Methods Applic. 2:318-322). In particular, genomic DNA 

20 is first amplified in the presence of primer to linker sequence and a primer 
specific to the known region. The amplified sequences are then subjected to a 
second round of PCR with the same linker primer and another specific primer 
internal to the first one. Products of each round of PCR are transcribed with an 
appropriate RNA polymerase and sequenced using reverse transcriptase. 

25 Inverse PCR may also be used to amplify or extend sequences using 

divergent primers based on a known region (Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186). The primers may be designed using OLIGO 4. 06 Primer Analysis, 
software (National Biosciences Inc. , Plymouth, Minn. ), or another appropriate 
program, to be 22-30 nucleotides in length, to have a GC content of 50% or 

30 more, and to anneal to the target sequence at temperatures about 68°-72° C The 
method uses several restriction enzymes to generate a suitable fragment in the 
known region of a gene. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 
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Another method which may be used is capture PCR which involves PCR 
amplification of DNA fragments adjacent to a known sequence in human and 
yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) PCR Methods 
Applic. 1:111-119). In this method, multiple restriction enzyme digestions and 
5 ligations may also be used to place an engineered double-stranded sequence into 
an unknown portion of the DNA molecule before performing PCR. 

Another method which may be used to retrieve unknown sequences is 
that of Parker, J. D. et al. (1991; Nucleic Acids Res. 19:3055-3060). Additionally, 
one may use PCR, nested primers, and PROMOTERFINDER libraries 
10 (Clontech, Palo Alto, Calif. ) to walk genomic DNA. This process avoids the 
need to screen libraries and is useful in finding intron/exon junctions. 

When screening for full-length cDNAs, it is preferable to use libraries 
that have been size-selected to include larger cDNAs. Also, random-primed 
libraries are preferable, in that they will contain more sequences which contain 
15 the 5' regions of genes. Use of a randomly primed library may be especially 
preferable for situations in which an oligo d(T) library does not yield a full-length 
cDNA. 

Genomic libraries may be useful for extension of sequence into the 5' and 3' non- 
transcribed regulatory regions. 

20 Capillary electrophoresis systems which are commercially available may 

be used to analyze the size or confirm the nucleotide sequence of sequencing or 
PCR products. In particular, capillary sequencing may employ flowable polymers 
for electrophoretic separation, four different fluorescent dyes (one for each 
nucleotide) which are laser activated, and detection of the emitted wavelengths by 

25 a charge coupled device camera. Output/light intensity may be converted to 
electrical signal using appropriate software (e. g. GENOTYPER and 
SEQUENCE NAVIGATOR, Perkin Elmer) and the entire process from loading 
of samples to computer analysis and electronic data display may be computer 
controlled: 

30 Capillary electrophoresis is especially preferable for the sequencing of 

small pieces of DNA which might be present in limited amounts in a particular 
sample. 

In another embodiment of the invention, polynucleotide sequences or 
fragments thereof which encode HOMPS, or fusion proteins or functional 
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equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of HOMPS in appropriate host cells. Due to the inherent degeneracy 
of the genetic code,, other DNA sequences which encode substantially the same 
or a functionally equivalent amino acid sequence may be produced and these 
5 sequences may be used to clone and express HOMPS. 

As will be understood by those of skill in the art, it may be advantageous 
to produce HOMPS-encoding nucleotide sequences possessing non-naturally 
occurring codons. For example, codons preferred by a particular prokaryotic or 
eukaryotic host can be selected to increase the rate of protein expression or to 

10 produce a recombinant RNA transcript having desirable properties, such as a 
half-life which is longer than that of a transcript generated from the naturally 
occurring sequence. 

The nucleotide sequences of the present invention can be engineered 
using methods generally known in the art in order to alter HOMPS encoding 

15 sequences for a variety of reasons, including but not limited to, alterations which 
modify the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide sequences. For 
example, site-directed mutagenesis may be used to insert new restriction sites, 

20 alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. . 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences encoding HOMPS may be ligated to a 
heterologous sequence to encode a fusion protein. For example, to screen 

25 peptide libraries for inhibitors of HOMPS activity, it may be useful to encode a 
chimeric HOMPS protein that can be recognized by a commercially available 
antibody. A fusion protein may also be engineered to contain a cleavage site 
located between the HOMPS encoding sequence and the heterologous protein 
sequence, so that HOMPS may be cleaved and purified away from the 

30 heterologous moiety. 

In another embodiment, sequences encoding HOMPS may be 
synthesized, in whole or in part, using chemical methods well known in the art 
(see Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. 
et al. (1980) Nucl. Acids Res. Symp. Ser. 225-232). Alternatively, the protein itself 
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may be produced using chemical methods to synthesize the amino acid sequence 
of HOMPS, or a portion thereof. For example, peptide synthesis can be 
performed using various solid-phase techniques (Roberge, J. Y. et al. (1995) 
Science 269:202-204) and automated synthesis may be achieved, for example, 
5 using the ABI 431 A Peptide Synthesizer (Perkin Elmer). 

The newly synthesized peptide may be substantially, purified by 
preparative high performance liquid chromatography (e. g. , Creighton, T. (1983) 
Proteins, Structures and Molecular Principles, WH Freeman and Co. , New York, 
N. Y. ). The composition of the synthetic peptides may be confirmed by amino 
10 acid analysis or sequencing (e. g. , the Edman degradation procedure; Creighton, 
supra). 

Additionally, the amino acid sequence of HOMPS, or any part thereof, may be 
altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant 
15 polypeptide. 

In order to express a biologically active HOMPS, the nucleotide 
sequences encoding HOMPS or functional equivalents, may be inserted into 
appropriate expression vector, i. e. , a vector which contains the necessary 
elements for the transcription and translation of the inserted coding sequence. 

20 Methods which are well known to those skilled in the art may be used to 

construct expression vectors containing sequences encoding HOMPS and 
appropriate transcriptional and translational control elements. These methods 
include in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
genetic recombination. Such techniques are described in Sambrook, J. et al. 

25 (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, 
Plainview, N. Y. , and Ausubel, F. M. et al (1989) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York. N. Y. 

A variety of expression vector/host systems may be utilized to contain and 
express sequences encoding HOMPS. These include, but are not limited to, 
30 microorganisms such as bacteria transformed with recombinant bacteriophage, 
plasmid, or cosmid DNA expression vectors; yeast transformed with yeast 
expression vectors; insect cell systems infected with virus expression vectors (e. g. 
, baculovirus); plant cell systems transformed with virus expression vectors (e. g. , 
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cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial 
expression vectors (e. g. , Ti or pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" are those non- 
translated regions of the vector— enhancers, promoters, 5' and 3' untranslated 
5 regions-which interact with host cellular. proteins to carry out transcription and 
translation. Such elements may vary in their strength and specificity. 

Depending on the vector system and host utilized, any number of suitable 
transcription and translation elements, including constitutive and inducible 
promoters, may be used. For example, when cloning in bacterial systems, 

10 inducible promoters such as the hybrid lacZ promoter of the PBLUESCRIPT 
phagemid (Stratagene, Lajolla, Calif. ) or PSPORT1 plasmid (Gibco BRL) and 
the like may be used. The baculovirus polyhedrin promoter may be used in insect 
cells. Promoters or enhancers derived from the genomes of plant cells (e. g. , heat 
shock, RUBISCO; and storage protein genes) or from plant viruses , (e. g. , viral 

15 promoters or leader sequences) may be cloned into the vector. In mammalian cell 
systems, promoters from mammalian genes or from mammalian viruses are 
preferable. If it is necessary to generate a cell line that contains multiple copies of 
the sequence encoding HOMPS, vectors based on SV40 or EBV may be used 
with an appropriate selectable marker. 

20 In bacterial systems, a number of expression vectors may be selected 

depending upon the use intended for HOMPS. For example, when large 
quantities of HOMPS are needed for the induction of antibodies, vectors which 
direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors include, but are not limited to, the multifunctional E. coli 

25 cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the 
sequence encoding HOMPS may be ligated into the vector in frame with 
sequences for the aniino-terminal Met and the subsequent 7 residues of P- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. 
and S. M. Schuster (1989) J. Biol Chem. 264:5503-5509); and the like. pGEX 

30 vectors (Promega, Madison, Wis. ) may also be used to express foreign 
polypeptides as fusion proteins with glutathione S-transferase (GST). In general, 
such fusion proteins are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the presence of 
free glutathione. Proteins made in such systems may be designed to include 
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heparin, thrombin, or factor XA protease cleavage sites so that the cloned 
polypeptide of interest can be released from the GST moiety at will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and 
5 PGH may be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) 
Methods Enzymol. 153:516-544. 

In cases where plant expression vectors are used, the expression of 
sequences encoding HOMPS may be driven by any of a number of promoters. 
For example, viral promoters such as the 35S and 19S promoters of CaMV may 

10 be used alone or in combination with the omega leader sequence from TMV 
(Takamatsu, N. (1987) EMBO J. 6:307-311. Alternatively, plant promoters such 
as the small subunit of RUBISCO or heat shock promoters may be used 
(Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al (1984) Science 
224:838-843; and Winter, J. et al. (1991) Results ProbL Cell Differ. 17:85-105). 

15 These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. Such techniques are described 
in a number of generally available reviews (see, for example, Hobbs, S. or Murry, 
L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, 
New York, N. Y. ; pp. 191-196). 

20 An insect system may also be used to express HOMPS. For example, in 

one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is 
used as a vector to express foreign genes in Spodoptera frugiperda cells or in 
Trichoplusia larvae. The sequences encoding HOMPS may be cloned into a non- 
essential region of the virus, such as the polyhedrin gene, and placed under 

25 control of the polyhedrin promoter. Successful insertion of HOMPS will render 
the polyhedrin gene inactive and produce recombinant virus lacking coat protein. 
The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which HOMPS may be expressed (Engelhard, E. 
K. et al. (1994) Proc. Nat Acad. Sci. 91 :3224-3227). 

30 In mammalian host cells, a number of viral-based expression systems may 

be utilized. In cases where an adenovirus is used as an expression vector, 
sequences encoding HOMPS may be ligated into an adenovirus 
transcription/ translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome 
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may be used to obtain a viable virus which is capable of expressing HOMPS in 
infected host cells (Logan, J. and Shenk, T. (1984) Proc. Nad. Acad. Sci. 81:3655- 
3659). In addition, transcription enhancers, such as the Rous sarcoma virus 
(RSV) enhancer, may be used to increase expression in mammalian host cells. 
5 Specific initiation signals may also be used to achieve more efficient 

translation of sequences encoding HOMPS. Such signals include the ATG 
initiation codon and adjacent sequences. In cases where sequences encoding 
HOMPS, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational 
10 control signals may be needed. However, in cases where only coding sequence, or 
a portion thereof, is inserted, exogenous translational control signals including 
the ATG initiation codon should be provided. Furthermore, the initiation codon 
should be in the correct reading frame to ensure translation of the entire insert. 
Exogenous translational elements and initiation codons may be of various 
15 origins, both natural and synthetic. The efficiency of expression may be enhanced 
by the inclusion of enhancers which are appropriate for the particular cell system 
which is used, such as those described in the literature (Scharf, D. et al. (1994) 
Results Probl. Cell Differ. 20: 1 25-1 62). 

In addition, a host cell strain may be chosen for its ability to modulate the 
20 expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not 
limited to, acetylation, carboxylation. glycosylation, phosphorylation, lipidation, 
and acyktion. Post-translational processing which cleaves a "prepro" form of the 
protein may also be used to facilitate correct insertion, folding and/or function. 
25 Different host cells such as CHO, HeLa, MDCK, HEK293, and WI38, which 
have specific cellular machinery and characteristic mechanisms for such post- 
translational activities, may be chosen to ensure the correct modification and 
processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
30 expression is preferred. For example, cell lines which stably express HOMPS may 
be transformed using expression vectors which may contain viral origins of 
replication and/ or endogenous expression elements and a selectable marker gene 
on the same or on a separate vector. Following the introduction of the vector, 
cells may be allowed to grow for 1-2 days in an enriched media before they are 
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switched to selective media. The purpose of the selectable marker is to confer 
resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably 
transformed cells may be proliferated using tissue culture techniques appropriate 
5 to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus 
thymidine kinase (Wigler, M. et ah (1977) Cell 11:223-32) and adenine 
phosphoribosyltransferase (Lowy, I. et al. (1990) Cell 22:817-23) genes which can 

10 be employed in tk" or aprt" cells, respectively. Also, antimetabolite, antibiotic or 
herbicide resistance can be used as the basis for selection; for example, dhfr 
which confers resistance to methotrexate (Wigler, M. et al. (1980) Proc. Nad. 
Acad. Sci. 77:3567-70); npt, which confers resistance to the aminoglycosides, 
neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14); 

15 and als or pat, which confer resistance to chlorsulfuron and phosphinotricin 
aceryltransferase, respectively (Murry, supra). Additional selectable genes have 
been described, for example, trpB, which allows cells to utilize indole in place of 
tryptophan, or hisD, which allows cells to utilize histinol in place of histidine 
(Hartman, S. C. and R. C. Mulligan (1988) Proc. Nad. Acad. Sci. 85:8047-51). 

20 Recendy, the use of visible markers has gained popularity with such markers as 
anthocyanins, P glucuronidase and its substrate GUS, and luciferase and its 
substrate luciferin, being widely used not only to identify trans forrnants, but also 
to quantify the amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes, C A. et al. (1995) Methods Mol. BioL 55:121- 

25 131). 

Although the presence /absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding HOMPS is inserted within a 
marker gene sequence, recombinant cells containing sequences encoding 
30 KOMPS can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding HOMPS under 
the control of a single promoter. Expression of the marker gene in response to 
induction or selection usually indicates expression of the tandem gene as welL 
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Alternatively, host cells which contain the nucleic acid sequence encoding 
HOMPS and express HOMPS may be identified by a variety of procedures 
known to those of skill in the art. These procedures include, but are not limited 
to, DNA-DNA or DNA-RNA hybridizations and protein bioassay or 
5 immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein. 

The presence of polynucleotide sequences encoding HOMPS can be 
detected by DNA-DNA or DNA-RNA hybridization or amplification using 
probes or portions or fragments of polynucleotides encoding HOMPS. Nucleic 
N 10 acid amplification based assays involve the use of oligonucleotides or oligomers 
based on the sequences encoding HOMPS to detect trans formants containing 
DNA or RNA encoding HOMPS. As used herein "oligonucleotides" or 
"oligomers 1 ' refer to a nucleic acid sequence of at least about 10 nucleotides and 
as many as about 60 nucleotides, preferably about 15 to 30 nucleotides, and more 
15 preferably about 20-25 nucleotides, which can be used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression of 
HOMPS, using either polyclonal or monoclonal antibodies specific *for the 
protein are known in the art. Examples include enzyme-linked immunosorbent 
assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting 
20 (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on HOMPS is preferred, but 
a competitive binding assay may be employed. These and other assays are 
described, among other places, in Hampton, R. et/al (1990; Serological Methods, 
a Laboratory Manual, APS Press, St Paul. Minn. ) and Maddox, D. E. et al. (1983; 
25 J. Exp. Med. 158:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. 
Means for producing labeled hybridization or PCR probes for detecting 
sequences related to polynucleotides encoding HOMPS include oligolabeling, 
30 nick translation, end-labeling or PCR amplification using a labeled nucleotide. 

Alternatively, the sequences encoding HOMPS, or any portions thereof 
may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize 
RNA probes in vitro by addition of an appropriate RNA polymerase such as 17, 
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T3, or SP6 and labeled nucleotides. These procedures may be conducted using a 
variety of commercially available kits Amersham Pharmacia Biotech, Promega, 
and US Biochemical Corp. Suitable reporter molecules or labels, which may be 
used, include radionuclides, enzymes, fluorescent, chemiluminescent, or 
5 chromogenic agents as well as substrates, co factors, inhibitors, magnetic particles, 
and the like. 

Host cells transformed with nucleotide sequences encoding HOMPS may 
be cultured under conditions suitable for the expression and recovery of the 
protein from cell culture. The protein produced by a recombinant cell may be 

10 secreted or contained intracellularly depending on the sequence and/ or the 
vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode HOMPS may be designed to contain 
signal sequences which direct secretion of HOMPS through a prokaryotic or 
eukaryotic cell membrane. Other recombinant constructions may be used to join 

15 sequences encoding HOMPS to nucleotide sequence encoding a polypeptide 
domain which will facilitate purification of soluble proteins. Such purification 
facilitating domains include, but are not limited to, metal chelating peptides such 
as his tidine- tryptophan modules that allow purification on immobilized metals, 
protein A domains that allow purification on immobilized immunoglobulin, and 

20 the domain utilized in the FLAGS extension/affinity purification system 
(Immunex Corp. , Seattle, Wash. ). The inclusion of cleavable linker sequences 
such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
Calif. ) between the purification domain and HOMPS may be used to facilitate 
purification. 

25 One such expression vector provides for expression of a fusion protein 

containing HOMPS and a nucleic acid encoding 6 his tidine residues preceding a 
thioredoxin or an enterokinase cleavage site. The histidine residues facilitate 
purification on IMIAC (immobilized metal ion affinity chromatography) as 
described in Porath, J. et al. (1992, Prot Exp. Purif. 3: 263-281) while the 

30 enterokinase cleavage site provides a means for purifying HOMPS from the 
fusion protein. A discussion of vectors which contain fusion proteins is provided 
in Kroll, D. j. et aL (1993; DNA Cell Biol. 12:441-453). 

In addition to recombinant production fragments of HOMPS may be 
produced by direct peptide synthesis using solid-phase techniques (Merri field J. 
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(1963) J. Am. Chem. Soc 85:2149-2154). Protein synthesis may be performed 
using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using Applied Biosys terns 431 A Peptide Synthesizer 
(Perkin Elmer). Various fragments of HOMPS may be chemically synthesized 
5 separately and combined using chemical methods to produce the full length 
molecule. 

Other specifically contemplated embodiments of the invention disclosed 
herein are genomic DNA, cDNAs, ribozymes, and antisense molecules, as well as 
nucleic acid molecules based on an alternative backbone or including alternative 

10 bases, whether derived from natural sources or synthesized. For example, 
antisense molecules can be RNAs or other molecules, including peptide nucleic 
acids (PNAs) or non-nucleic acid molecules such as phosphorothioate 
derivatives, that specifically bind DNA or RNA in a base pair-dependent manner. 
A skilled artisan can readily obtain these classes of nucleic acid molecules using the 

15 HOMPS polynucleotides and polynucleotide sequences disclosed herein. 

Antisense technology entails the administration of exogenous 
oligonucleotides that bind to a target polynucleotide located within the cells. The 
term "antisense" refers to the fact that such oligonucleotides are complementary 
to their intracellular targets, e.g., HOMPS. See for example, Jack Cohen, 1988, 

20 OLIGODEOXYNUCLEOTIDES, Antisense Inhibitors of Gene Expression, 
CRC Press; and Synthesis 1:1-5 (1988). The HOMPS antisense oligonucleotides 
of the present invention include derivatives such as S-oligonucleo tides 
(phosphorothioate derivatives or S-oligos, see, Jack Cohen, supra), which exhibit 
enhanced cancer cell growth inhibitory action. S-oligos (nucleoside 

25 phosphorothioates) are isoelectronic analogs of an oligonucleotide (O-oligo) in 
which a nonbridging oxygen atom of the phosphate group is replaced by a sulfur 
atom. The S-oligos of the present invention may be prepared by treatment of the 
corresponding O-oligos with 3H-l,2-benzodithiol-3-one- 1,1 -dioxide, which is a 
sulfur transfer reagent. See Iyer, R. P. et al, 1990, J. Org. Chem. 55:4693-4698; 

30 and Iyer, R. P. et al., 1990, J. Am. Chem. Soc. 112:1253-1254, the disclosures of 
which are fully incorporated by reference herein. Additional HOMPS antisense 
oligonucleotides of the present invention include morpholino antisense 
oligonucleotides known in the art (see e.g. Partridge et al., 1996, Antisense & 
Nucleic Acid Drug Development 6: 169-175). 
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The HOMPS antisense oligonucleotides of the present invention typically 
may be RNA or DNA that is complementary to and stably hybridizes with the 
first 100 N-terminal codons or last 100 C-terminal codons of the HOMPS 
genomic sequence or the corresponding mRNA. While absolute 
5 complementarity is not requited, high degrees of complementarity are preferred. 
Use of an oligonucleotide complementary to this region allows for the selective 
hybridization to HOMPS mRNA and not to mRNA specifying other regulatory 
subunits of protein kinase. Preferably, the HOMPS antisense oligonucleotides of 
the present invention are a 15 to 30-mer fragment of the antisense DNA 

10 molecule having a sequence that hybridizes to HOMPS mRNA. Optionally, 
HOMPS antisense oligonucleotide is a 30-mer oligonucleotide that is 
complementary to a region in the first 10 N-terminal codons and last 10 C- 
terminal codons of HOMPS. Alternatively, the antisense molecules are modified 
to employ ribozymes in the inhibition of HOMPS expression (L. A. Couture & 

15 D. T. Stinchcomb, 1996, Trends Genet 12: 510-515). 

Further specific embodiments of this aspect of the invention include 
primers and primer pairs, which allow the specific amplification of the 
polynucleotides of the invention or of any specific parts thereof, and probes that 
selectively or specifically hybridize to nucleic acid molecules of the invention or 

20 to any part thereof. Probes may be labeled with a detectable marker, such as, for 
example, a radioisotope, fluorescent compound, biolurninescent compound, a 
cheniiluminescent compound, metal chelator or enzyme. Such probes and 
primers can be used to detect the presence of a HOMPS polynucleotide in a sample 
and as a means for detecting a cell expressing a HOMPS protein. 

25 Illustrative examples of such probes include polypeptides comprising all or 

part of the human HOMPS H41 cDNA sequences shown in FIG. 5. Examples of 
primer pairs capable of specifically amplifying HOMPS mRNAs are also described 
in the Examples that follow. As will be understood by the skilled artisan, a great 
many different primers and probes may be prepared based on the sequences 

30 provided herein and used effectively to amplify and/ or detect a HOMPS mRNA. 

As used herein, a polynucleotide is said to be "isolated" when it is 
substantially separated from contaminant polynucleotides that correspond or are 
complementary to genes other than the HOMPS gene or that encode polypeptides 
other than HOMPS gene product or fragments thereof. A skilled artisan can readily 
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employ nucleic acid isolation procedures to obtain an isolated HOMPS 
polynucleotide. 

The HOMPS polynucleotides of the invention are useful for a variety of 
purposes, including but not limited to their use as probes and primers for the 
5 amplification and/or detection of the HOMPS gene(s), mRNA(s), or fragments 
thereof; as reagents for the evaluation and/or diagnosis and/or prognosis of 
cancers; as coding sequences capable of directing the expression of HOMPS 
polypeptides; as tools for modulating or inhibiting the expression of the HOMPS 
gene(s) and/or translation of the HOMPS transcript(s); and as therapeutic agents. 

10 Expression vectors derived from retroviruses, adenovirus, herpes or 

vaccinia viruses, or from various bacterial plasmids may be used for delivery of 
nucleotide sequences to the targeted organ, tissue or cell population. Methods 
which are well known to those skilled in the art can be used to construct 
recombinant vectors which will express anrisense molecules complementary to 

15 the polynucleotides of the gene encoding HOMPS. These techniques are 
described both in Sambrook et al. (supra) and in Ausubel et aL (supra). 

Genes encoding HOMPS can be turned off by transforming a cell or 
tissue with expression vectors which express high levels of a polynucleotide or 
fragment thereof which encodes HOMPS. Such constructs may be used to 

20 introduce untranslatable sense or antisense sequences into a cell. Even in the 
absence of integration into the DNA, such vectors may continue to transcribe 
RNA molecules until they are disabled by endogenous nucleases. Transient 
expression may last for a month or more with a non-replicating vector and even 
longer if appropriate replication elements are part of the vector system. 

25 As mentioned above, modifications of gene expression can be obtained 

by designing antisense molecules, DNA, RNA, or PNA, to the control regions of 
the gene encoding HOMPS, L e. the promoters, enhancers, and introns. 
Oligonucleotides derived from the transcription initiation site, e. g. , between 
positions -10 and +10 from the start site, are preferred. Similarly, inhibition can 

30 be achieved using "triple helix" base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the double helix to open 
sufficiendy for the binding of polymerases, transcription factors, or regulatory 
molecules. Recent therapeutic advances using triplex DNA have been described 
in the literature (Gee, J. E. et al. (1994) In: Huber, B. E. and B. I. Carr, Molecular 
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and Immunologic Approaches, Futura Publishing Co. , Mt. Kisco, N. Y. ). The 
anrisense molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
5 specific cleavage of RNA. The mechanism of ribozyme action involves sequence- 
specific hybridization of the ribozyme molecule to complementary target RNA; 
followed by endonucleolytic cleavage. Examples which may be used include 
engineered hammerhead motif ribozyme molecules that can specifically and 
efficiendy catalyze endonucleolytic cleavage of sequences encoding HOMPS. 

10 Specific ribozyme cleavage sites within any potential RNA target are 

initially identified by scanning the target molecule for ribozyme cleavage sites 
which include the following sequences: GUA, GUU, and GUC. Once identified, 
short RNA sequences of between 1 5 and 20 ribonucleotides corresponding to 
the region of the target gene containing the cleavage site may be evaluated for 

15 secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility 
to hybridization with complementary oligonucleotides using ribonuclease 
protection assays. 

Antisense molecules and ribozymes of the invention may be prepared by 
20 any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid 
phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences encoding 
HOMPS. 

25 Such DNA sequences may be incorporated into a wide variety of vectors 

with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these 
cDNA constructs that synthesize antisense RNA consatutively or inducibly can 
be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and 

30 half-life. 

Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends of the molecule or the use of 
phosphorothioate or 2* O-methyl rather than phosphodiesterase linkages within 
the backbone of the molecule. This concept is inherent in the production of 
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PNAs and can be extended in all of these molecules by the inclusion of 
nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, 
methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous 
5 endonucleases. 

Many methods for introducing vectors into cells or tissues are available 
and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, 
vectors may be introduced into stem cells taken from the patient and clonally 
propagated for autologous transplant back into that same patient. Delivery by 
10 transfection and by liposome injections may be achieved using methods which 
are well known in the art Where appropriate, the methods described above may 
be applied to any subject in need of such therapy, including, for example, 
mammals such as dogs, cats, cows, horses, rabbits, monkeys, and most 
preferably, humans. 

15 As discussed in detail below, in another embodiment, antibodies which 

specifically bind HOMPS may be used for the evaluation and characterization of 
conditions or diseases characterized by expression of HOMPS, or in assays to 
monitor patients being treated with HOMPS, agonists, antagonists or inhibitors. 
Hie antibodies useful for diagnostic purposes may be prepared in the same 

20 manner as those described above for therapeutics. Diagnostic assays for HOMPS 
include methods which utilize the antibody and a label to detect HOMPS in 
human body fluids or extracts of cells or tissues. The antibodies may be used 
with or without modification, and may be labeled by joining them, either 
covalehdy or non-covalendy, with a reporter molecule. A wide variety of reporter 

25 molecules which are known in the art may be used, several of which are 
described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring 
HOMPS are known in the art and provide a basis for diagnosing altered or 
abnormal levels of HOMPS expression. Normal or standard values for HOMPS 
30 expression are established by combining body fluids or cell extracts taken from 
normal mammalian subjects, preferably human, with antibody to HOMPS under 
conditions suitable for complex formation. The amount of standard complex 
formation may be quantified by various methods, but preferably by photometric, 
means. Quantities of HOMPS expressed in subject samples, control and diseases 
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from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding 
HOMPS may be used for diagnostic purposes. The polynucleotides which may 
5 be used include oligonucleotide sequences, antisense RNA and DNA molecules, 
and PNAs. The polynucleotides may be used to detect and quantitate gene 
expression in biopsied tissues in which expression of HOMPS may be correlated 
with disease. The diagnostic assay may be used to distinguish between absence, 
presence, and excess expression of HOMPS, and to monitor regulation of 

10 HOMPS levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of 
detecting polynucleotide sequences, including genomic sequences, encoding 
HOMPS or closely related molecules, may be used to identify nucleic acid 
sequences which encode HOMPS. The specificity of the probe, whether it is 

15 made from a highly specific region, e. g. , 10 unique nucleotides in the 5' 
regulatory region, or a less specific region, e. g. , especially in the 3* coding region, 
and the stringency of the hybridization or amplification (maximal, high, 
intermediate, or low) will determine whether the probe identifies only naturally 
occurring sequences encoding HOMPS, alleles, or related sequences. 

20 Probes may also be used for the detection of related sequences, and 

should preferably contain at least 50% of the nucleotides from any of the 
HOMPS encoding sequences. The hybridization probes of the subject invention 
may be DNA or RNA and derived from the nucleotide sequence of Figure 1, 
Figure 3, Figure 5, Figure 8 or Figure 10 or from genomic sequence including 

25 promoter, enhancer elements, and introns of the naturally occurring HOMPS. 

Means for producing specific hybridization probes for DNAs encoding 
HOMPS include the cloning of nucleic acid sequences encoding HOMPS or 
HOMPS derivatives into vectors for the production of mRNA probes. Such 
vectors are known in the art, commercially available, and may be used to 

30 synthesize RNA probes in vitro by means of the addition of the appropriate 
RNA polymerases and the appropriate labeled nucleotides. Hybridization probes 
may be labeled by a variety of reporter groups, for example, radionuclides such as 
32 P or 35 S, or enzymatic labels, such as alkaline phosphatase coupled to the probe 
via avidin/biotin coupling systems, and the like. 
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Polynucleotide sequences encoding HOMPS may be used for the 
evaluation and characterization of disorders associated with the expression of 
HOMPS. Examples of such disorders include: various types of cancer such as 
breast cancer. The polynucleotide sequences encoding HOMPS may be used in 
5 Southern or northern, analysis, dot blot, or other membrane-based technologies; 
in PCR technologies; or in dip stick, pin, ELISA or chip assays utilizing fluids or 
tissues from patient biopsies to detect altered HOMPS expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding HOMPS may 

10 be useful in assays that detect activation or induction of various cancers such as 
breast cancer, particularly those mentioned above. The nucleotide sequences 
encoding HOMPS may be labeled by standard methods, and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of 
hybridization complexes. After a suitable incubation period, the sample is washed 

15 and the signal is quantitated and compared with a standard value. If the amount 
of signal in the biopsied or extracted sample is significantly altered from that of a 
comparable control sample, the nucleotide sequences have hybridized with 
nucleotide sequences in the sample, and the presence of altered levels of 
nucleotide sequences encoding HOMPS in the sample indicates the presence of 

20 the associated disease. Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal studies, in clinical trials, or' in 
monitoring the treatment of an individual patient. 

In order to provide a basis for the evaluation and characterization of 
disease associated with expression of HOMPS, a normal or standard profile for 

25 expression is established. This may be accomplished by combining body fluids or 
cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, which encodes HOMPS, under conditions 
suitable for hybridization or amplification. Standard hybridization may be 
quantified by comparing the values obtained from normal subjects with those 

30 from an experiment where a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may be 
compared with values obtained from samples from patients who are symptomatic 
for disease. Deviation between standard and subject values is used to establish 
the presence of disease. 
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Once disease is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to evaluate whether the 
level of expression in the patient begins to approximate that which is observed in 
the normal patient The results obtained from successive assays may be used to 
5 show the efficacy of treatment over a period ranging from several days to 
months. 

With respect to cancer, the presence of a relatively high amount of 
transcript in biopsied tissue from an individual may indicate a predisposition for 
the development of the disease, or may provide a means for detecting the disease 

10 prior to the appearance of actual clinical symptoms. A more definitive evaluation 
and characterization of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the 
development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the 

15 sequences encoding HOMPS may involve the use of PCR. Such oligomers may 
be chemically synthesized, generated enzymatically, or produced from a 
recombinant source. Oligomers will preferably consist of two nucleotide 
sequences, one with sense orientation (5 , ->3 t ) and another with antisense (3'<-5'), 
employed under optimized conditions for identification of a specific gene or 

20 condition. The same two oligomers, nested sets of oligomers, or even a 
degenerate pool of oligomers may be employed under less stringent conditions 
for detection and/or quantitation of closely related DNA or RNA sequences. 

Another aspect of the present invention relates to methods for detecting 
HOMPS polynucleotides and HOMPS proteins and variants thereof, as well as 

25 methods for identifying a cell that expresses HOMPS. The expression profile of 
HOMPS makes it a potential diagnostic marker for local and/or metastasized 
disease. In this context, the status of HOMPS gene products may provide 
information useful for predicting a variety of factors including susceptibility to 
advanced stage disease, rate of progression, and/or tumor aggressiveness. As 

30 discussed in detail below, the status of HOMPS gene products in patient samples 
may be analyzed by a variety protocols that are well known in the art including 
irnmuriohistochemical analysis, the variety of Northern blotting techniques 
including in situ hybridization, RT-PCR analysis (for example on laser capture 
micro-dissected samples), western blot analysis and tissue array analysis. 
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More particularly, the invention provides assays for the detection of 
HOMPS polynucleotides in a biological sample, such as breast or uterine tissue, 
serum, bone, prostate, and other tissues, urine, semen, cell preparations, and the 
like. Detectable HOMPS polynucleotides include, for example, a HOMPS gene or 
5 fragments thereof, HOMPS mRNA, alternative, splice variant HOMPS mRNAs, 
and recombinant DNA or RNA molecules containing a HOMPS polynucleotide. A 
number of methods for amplifying and/or detecting the presence of HOMPS 
polynucleotides are well known in the art and may be employed in the practice of 
this aspect of the invention. 

10 In one embodiment, a method for detecting a HOMPS mRNA in a 

biological sample comprises producing cDNA from the sample by reverse 
transcription using at least one primer; amplifying the cDNA so produced using a 
HOMPS polynucleotides as sense and antisense primers to amplify HOMPS 
cDNAs therein; and detecting the presence of the amplified HOMPS cDNA. 

15 Optionally, the sequence of the amplified HOMPS cDNA can be determined. In 
another embodiment, a method of detecting a HOMPS gene in a biological 
sample comprises first isolating genomic DNA from the sample; amplifying the 
isolated genomic DNA using HOMPS polynucleotides as sense and antisense 
primers to amplify the HOMPS gene therein; and detecting the presence of the 

20 amplified HOMPS gene. Any number of appropriate sense and antisense probe 
combinations may be designed from the nucleotide sequences provided for the 
HOMPS (e.g. H41 as shown in FIG. 6) and used for this purpose. 

The invention also provides assays for detecting the presence of a HOMPS 
protein in a tissue of other biological sample such as breast or uterine tissue, serum, 

25 bone, prostate, and other tissues, urine, semen, cell preparations, and the like. 
Methods for detecting a HOMPS protein are also well known and include, for 
example, irnmunoprecipitation, immunohistochemical analysis, Western Blot 
analysis, molecular binding assays, ELISA, ELIFA and the like. For example, in 
one embodiment, a method of detecting the presence of a HOMPS protein in a 

30 biological sample comprises first contacting the sample with a HOMPS antibody, 
a HOMPS-reactive fragment thereof, or a recombinant protein containing an 
antigen binding region of a HOMPS antibody; and then detecting the binding of 
HOMPS protein in the sample thereto. 
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Methods for identifying a cell that expresses HOMPS are also provided. In 
one embodiment, an assay for identifying a cell that expresses a HOMPS gene 
comprises detecting the presence of HOMPS rnRNA in the cell. Methods for the 
detection of particular mRNAs in cells are well known and include, for example, 
5 hybridization assays using complementary DNA probes (such as in situ 
hybridization using labeled HOMPS riboprobes, Northern blot and related 
techniques) and various nucleic acid amplification assays (such as RT-PCR using 
complementary primers specific for HOMPS, and other amplification type 
detection methods, such as, for example, branched DNA, SISBA, TMA and the 
10 like). Alternatively, an assay for identifying a cell that expresses a HOMPS gene 
comprises detecting the presence of HOMPS protein in the cell or secreted by the 
cell. Various methods for the detection of proteins are well known in the art and 
may be employed for the detection of HOMPS proteins and HOMPS expressing 
cells. 

15 HOMPS expression analysis may also be useful as a tool for identifying and 

evaluating agents that modulate HOMPS gene expression. Identification of a 
molecule or biological agent that could inhibit HOMPS expression or over- 
expression in cancer cells may be of therapeutic value. Such an agent may be 
identified by using a screen that quantifies HOMPS expression by RT-PCR, 

20 nucleic acid hybridization or antibody binding. 

MONITORING THE STATUS OF HOMPS 

Assays that evaluate the status of the HOMPS gene and HOMPS gene 
products in an individual may provide information on the growth or oncogenic 
potential of a biological sample from this individual For example, because 

25 HOMPS are modulated by Her-2, an oncogene associated with a number of 
cancers, assays that evaluate the relative levels of HOMPS rnRNA transcripts or 
proteins in a biological sample may be used to evaluate diseases associated with 
HOMPS disregulation such as cancer and may provide prognostic information 
useful in defining appropriate therapeutic options. 

30 Because HOMPS expression is modulated, for example, in cells which 

cverexpress the Her-2 oncogene, the expression status of HOMPS can provide 
information useful for determining information including the presence, stage and 
location of displasic, precancerous and cancerous cells, predicting susceptibility to 
various stages of disease, and/or for gauging tumor aggressiveness. Consequently, 
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an important aspect of the invention is directed to the various molecular methods 
for exarnining the status of HOMPS in biological samples such as those from 
individuals suffering from, or suspected of suffering from a pathology 
characterized by disregulated cellular growth such as cancer. 
5 Oncogenesis is known to be a multistep process where cellular growth 

becomes progressively disregulated and cells progress from a normal 
physiological state to precancerous and then cancerous states (see e.g. Alers et al, 
Lab Invest. 77(5): 437-438 (1997) and Isaacs et al., Cancer Surv. 23: 19-32 
(1995)). In this context, exarriining a biological sample for evidence of 

10 disregulated cell growth can allow the early detection of such aberrant cellular 
physiology before a pathology such as cancer has progressed to a stage at which 
therapeutic options are more limited. In such examinations, the status of 
HOMPS in a biological sample of interest (such as one suspected of having 
disregulated cell growth) can be compared, for example, to the status of HOMPS 

15 in a corresponding normal sample (e.g. a sample from that individual (or 
alternatively another individual) that is not effected by a pathology, for example 
one not suspected of having disregulated cell growth) with alterations in the 
status of HOMPS in the biological sample of interest (as compared to the normal 
sample) providing evidence of disregulated cellular growth. In addition to using a 

20 biological sample that is not effected by a pathology as a normal sample, one can 
also use a predetermined normative value such as a predetermined normal level 
of mRNA expression (see e.g. Grever et al, J. Comp. NeuroL 1996 Dec 
9;376(2):306-14 and U.S. patent No. 5,837,501) to compare HOMPS in normal 
versus suspect samples. 

25 The term "status" in this context is used according to its art accepted 

meaning and refers to the condition or state of a gene and its products. As 
specifically described herein, the status of HOMPS can be evaluated by a number 
of parameters known in the art. Typically an alteration in the status of HOMPS 
comprises a change in the location of HOMPS expressing cells (as occurs in 

30 metastases) and/or an increase in HOMPS mRNA and/or protein expression. 

Typically, skilled artisans use a number of parameters to evaluate the 
condition or state of a gene and its products. These include, but are not limited to 
the location of expressed gene products (including the location of HOMPS 
expressing cells) as well as the, level, and biological activity of expressed gene 
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products (such as HOMPS mRNA polynucleotides and polypeptides). Alterations 
in the status of HOMPS can be evaluated by a wide variety of methodologies well 
known in the art, typically those discussed below. Typically an alteration in the 
status of HOMPS comprises a change in the location of HOMPS and/or 
5 HOMPS expressing cells and/or an increase in HOMPS mRNA and/or protein 
expression. 

As discussed in detail herein, in order to identify a condition or 
phenomenon associated with deregulated cell growth, the status of HOMPS in a 
biological sample may be evaluated by a number of methods utilized by skilled 

10 artisans including, but not limited to genomic Southern analysis (to examine, for 
example perturbations in the HOMPS gene), northerns and/or PCR analysis of 
HOMPS mRNA (to examine, for example alterations in the polynucleotide 
sequences or expression levels of HOMPS mRNAs), and western and/or 
immunohistochemical analysis (to examine, for example alterations in 

15 polypeptide sequences, alterations in polypeptide localization within a sample, 
alterations in expression levels of HOMPS proteins and/or associations of 
HOMPS proteins with polypeptide binding partners). Detectable HOMPS 
polynucleotides include, for example, a HOMPS gene or fragments thereof, 
HOMPS mRNA, alternative splice variants HOMPS mRNAs, and recombinant 

20 DNA or RNA molecules containing a HOMPS polynucleotide. 

The expression profile of HOMPS makes them potential markers for 
deregulated cell growth. In particular, the status of HOMPS may provide 
information useful for predicting susceptibility to particular disease stages, 
progression, and/or tumor aggressiveness. The invention provides methods and 

25 assays for determining HOMPS status and diagnosing cancers that express 
HOMPS. HOMPS status in patient samples may be analyzed by a number of 
means well known in the art, including without limitation, immunohistochemical 
analysis, in situ hybridization, RT-PCR analysis on laser capture micro-dissected 
samples, western blot analysis of clinical samples and cell lines, and tissue array 

30 analysis. Typical protocols for evaluating the status of the HOMPS gene and gene 
products can be found, for example in Ausubul et al. eds., 1995, Current Protocols 
In Molecular Biology, Units 2 [Northern Blotting], 4 [Southern Blotting], 15 
Pmmunoblotting] and 18 [PCR Analysis]. 
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As described above, the status of HOMPS in a biological sample can be 
examined by a number of well known procedures in the art For example, the 
status of HOMPS in a biological sample taken from a specific location in the 
body can be examined by evaluating the sample for the presence or absence of 
5 HOMPS expressing cells (e.g. those that. express HOMPS mRNAs or proteins). 
This examination can provide evidence of disregulated cellular growth for 
example, when HOMPS expressing cells are found in a biological sample that 
does not normally contain such cells (such as a lymph node). Such alterations in 
the status of HOMPS in a biological sample are often associated with disregulated 

10 cellular growth. Specifically, one indicator of disregulated cellular growth is the 
metastases of cancer cells from an organ of origin (such as the breast) to a 
different area of the body (such as a lymph node). In this context, evidence of 
disregulated cellular growth is important for example because occult lymph node 
metastases can be detected in a substantial proportion of patients with cancers, 

15 and such metastases are associated with known predictors of disease progression 
(see e.g. Freeman et al., J Urol 1995 Aug;154(2 Pt l):474-8). 

HOMPS ANTIBODIES 

The term "antibody" is used in the broadest sense and specifically covers 
single anti-HOMPS monoclonal antibodies (including agonist, antagonist and 

20 neutralizing antibodies) and anti-HOMPS antibody compositions with polyepitopic 
specificity. The term "monoclonal antibody" (mAb) as used herein refers to an 
. antibody obtained from a population of substantially homogeneous antibodies, i.e. 
the antibodies comprising the individual population are identical except for possible 
natuially-occumng mutations that may be present in minor amounts. 

25 Another aspect of the invention provides antibodies that bind to HOMPS 

proteins and polypeptides. The most preferred antibodies will specifically bind to a 
HOMPS protein and will not bind (or will bind weakly) to non-HOMPS proteins 
and polypeptides. Anti-HOMPS antibodies that are particularly contemplated 
include monoclonal and polyclonal antibodies as well as fragments containing the 

30 antigen binding domain and/or one or more complementarity determining regions 
of these antibodies. As used herein, an antibody fragment is defined as at least a 
portion of the variable region of the immunoglobulin molecule that binds to its 
target, i.e., the antigen binding region. 
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HOMPS antibodies of the invention may be particularly useful in imaging 
methodologies. Intracellularly expressed antibodies (e.g., single chain antibodies) 
may be therapeutically useful in treating cancers in which the expression of 
HOMPS is involved, such as for example Het-2 overexpressing cancers. Such 
5 antibodies may* be useful in the analysis, treatment, evaluation and 
characterization, and/or prognosis of other cancers, to the extent HOMPS is also 
expressed or overexpressed in other types of cancers. 

The invention also provides various immunological assays useful for the 
detection and quantification of HOMPS and mutant HOMPS proteins and 

10 polypeptides. Such assays generally comprise one or more HOMPS antibodies 
capable of recognizing and binding a HOMPS or mutant HOMPS protein, as 
appropriate, and may be performed within various immunological assay formats 
well known in the art, including but not limited to various types of 
radioimmunoassays, enzyme-linked immunosorbent assays (ELISA), enzyme-linked 

15 immunofluorescent assays (ELIFA), and the like. In addition, immunological 
imaging methods capable of detecting cancers expressing HOMPS are also 
provided by the invention, including but limited to radioscintigraphic imaging 
methods using labeled HOMPS antibodies. Such assays may be clinically useful in 
the detection, monitoring, and prognosis of HOMPS expressing cancers. 

20 HOMPS antibodies may also be used in methods "for purifying HOMPS 

and mutant HOMPS proteins and polypeptides and for isolating HOMPS 
homologues and related molecules. For example, in one embodiment, the method 
of purifying a HOMPS protein comprises incubating a HOMPS antibody, which 
has been coupled to a solid matrix, with a iysate or other solution containing 

25 HOMPS under conditions that permit the HOMPS antibody to bind to HOMPS; 
washing the solid matrix to eliminate impurities; and eluting the HOMPS from the 
coupled antibody. Other uses of the HOMPS antibodies of the invention include 
generating anti-idiotypic antibodies that mimic the HOMPS protein. 

Various methods for the preparation of antibodies are well known in the 

30 art For example, antibodies may be prepared by immunizing a suitable mammalian 
host using a HOMPS protein, peptide, . or fragment, in isolated or 
immunoconjugated form (Harlow, and Lane, eds., 1988, Antibodies: A Laboratory 
Manual, CSH Press; Harlow, 1989, Antibodies, Cold Spring Harbor Press, NY). In 
addition, fusion proteins of HOMPS may also be used, such as a HOMPS GST- 
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fusion protein. In a particular illustrative embodiment, a GST fusion protein 
comprising all or most of the open reading frame amino acid sequence of H41 as 
shown in FIG. 6 may be produced and used as an immunogen to generate 
appropriate antibodies. In another embodiment, a HOMPS peptide may be 
5 synthesized and used as an immunogen. 

In addition, naked DNA immunization techniques known in the art may be 
used (with or without purified HOMPS protein or HOMPS expressing cells) to 
generate an immune response to the encoded immunogen (for review, see Donnelly 
et aL, 1997, Ann. Rev. Immunol. 15:617-648). 

10 In an illustrative embodiment, the amino acid sequence of the H41 

HOMPS protein as shown in FIG. 6 may be used to select specific regions of the 
HOMPS protein for generating antibodies. For example, hydrophobicity and 
hydrophilicity analyses of the HOMPS amino acid sequence may be used to identify 
hydrophilic regions in the HOMPS structure. Regions of the HOMPS protein that 

15 show immunogenic structure, as well as other regions and domains, can readily be 
identified using various other methods known in the art, such as Chou-Fasman, 
Garnier-Robson, Kyte-Doolitde, Eisenberg, Karplus-Schultz or Jameson-Wolf 
analysis. 

Methods for preparing a protein or polypeptide for use as an immunogen 
20 and for preparing immunogenic conjugates of a protein with a carrier such as BSA, 
KLH, or other carrier proteins are well known in the art In some circumstances, 
direct conjugation using, for example, carbodiimide reagents may be used; in other 
instances linking reagents such as those supplied by Pierce Chemical Co., Rockford, 
IL, may be effective. Administration of a HOMPS immunogen is conducted 
25 generally by injection over a suitable time period and with use of a suitable adjuvant, 
as is generally understood in the art During the immunization schedule, titers of 
antibodies can be taken to determine adequacy of antibody formation. 

HOMPS monoclonal antibodies are preferred and may be produced by 
various means well known in the art For example, immortalized cell lines that 
30 secrete a desired monoclonal antibody may be prepared using the standard 
hybridoma technology of Kohler and Milstein or modifications that immortalize 
producing B cells, as is generally known. The immortalized cell lines secreting the 
desired antibodies are screened by immunoassay in which the antigen is the 
HOMPS protein or a HOMPS fragment When the appropriate immortalized cell 
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culture secreting the desired antibody is identified, the cells may be expanded and 
antibodies produced either from in vitro cultures or from ascites fluid. 

The antibodies or fragments may also be produced, using current 
technology, by recombinant means. Regions that bind specifically to the desired 
5 regions of the HOMPS protein can also be produced in the context of chimeric or 
CDR grafted antibodies of multiple species origin. Humanized or human HOMPS 
antibodies may also be produced and are preferred for use in therapeutic contexts. 
Methods for humanizing murine and other non-human antibodies by substituting 
one or more of the non-human antibody CDRs for corresponding human antibody 

10 sequences are well known (see for example, Jones et aL, 1986, Nature 321:522-525; 
Riechmann et aL, 1988, Nature 332:323-327; Verhoeyen et aL, 1988, Science 
239:1534-1536). See also, Carter et aL, 1993, Proc. Nad Acad. Sci. USA 89:4285 
and Sims et aL, 1993, J. Immunol. 151:2296. Methods for producing fully human 
monoclonal antibodies include phage display and transgenic methods (for review, 

15 see Vaughan et aL, 1998, Nature Biotechnology 16:535-539). 

Fully human HOMPS monoclonal antibodies may be generated using 
cloning technologies employing large human Ig gene combinatorial libraries (Le., 
phage display) (Griffiths and Hoogenboom, Building an in vitro immune system: 
human antibodies from phage display libraries. In: Clark, M., ecL, 1993, Protein 

20 Engineering of Antibody Molecules for Prophylactic and Therapeutic Applications 
in Man, Nottingham Academic, pp 45-64; Burton and Barbas, Human Antibodies 
from combinatorial libraries. Id., pp 65-82). Fully human HOMPS monoclonal 
antibodies may also be produced using transgenic mice engineered to contain 
human immunoglobulin gene loci as described in PCT Patent Application 

25 W098/24893, Kucherlapati and Jakobovits et aL, published December 3, 1997 (see 
also, Jakobovits, 1998, Exp. Opin. Invest Drugs 7(4):607-614). This method avoids 
the in vitro manipulation required with phage display technology and efficiendy 
produces high affinity authentic human antibodies. 3 

Reactivity of HOMPS antibodies with a HOMPS protein may be 

30 established by a number of well known means, including western blot, 
uumunoprecipitation, ELISA, and FACS analyses using, as appropriate, HOMPS 
proteins, peptides, HOMPS-expressing cells or extracts thereof. 

A HOMPS antibody or fragment thereof of the invention may be labeled 
with a detectable marker or conjugated to a second molecule. Suitable detectable 
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markers include, but are not limited to, a radioisotope, a fluorescent compound, a 
bioluminescent compound, chemiluminescent compound, a metal chelator or an 
enzyme. A second molecule for conjugation to the HOMPS antibody can be 
selected in accordance with the intended use. For example, for therapeutic use, 
5 the second molecule can be a toxin or therapeutic agent. Further, bi-specific 
antibodies specific for two or more HOMPS epitopes may be generated using 
methods generally known in the art. Homodimeric antibodies may also be 
generated by cross-linking techniques known in the art (e.g., Wolff et al, 1993, 
Cancer Res. 53: 2560-2565). 

10 HOMPS TRANSGENIC ANIMALS ' 

Nucleic acids that encode HOMPS or its modified forms can also be 
used to generate either transgenic animals or "knock out" animals which, in turn, 
are useful in the development and screening of therapeutically useful reagents. A 
transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a 

15 transgene, which transgene was introduced into the animal or an ancestor of the 
animal at a prenatal, e.g., an embryonic stage. A transgene is a DNA that is 
integrated into the genome of a cell from which a transgenic animal develops. In 
one embodiment, cDNA encoding HOMPS can be used to clone genomic DNA 
encoding HOMPS in . accordance with established techniques and the genomic 

20 sequences used to generate transgenic animals that contain cells that express 
DNA encoding HOMPS. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are 
described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. Typically, 
particular cells would be targeted for HOMPS transgene incorporation with 

25 tissue-specific enhancers. Transgenic animals that include a copy of a transgene 
encoding HOMPS introduced into the germ line of the animal at an embryonic 
stage can be used to examine the effect of increased expression of DNA 
encoding HOMPS. Such animals can be used as tester animals for reagents 
thought to confer protection from, for example, pathological conditions 

30 associated with its overexpression. In accordance with this facet of the 
invention, an animal is treated with the reagent and a reduced incidence of the 
pathological condition, compared to untreated animals bearing the transgene, 
would indicate a potential therapeutic intervention for the pathological condition. 
Alternatively, non-human homologues of HOMPS can be used to 
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construct a HOMPS "knock out" animal that has a defective or altered gene 
encoding HOMPS as a result of homologous recombination between the 
endogenous gene encoding HOMPS and altered genomic DNA encoding 
HOMPS introduced into an embryonic cell of the animal For example, cDNA 
5 encoding HOMPS can be used to clone genomic DNA encoding HOMPS in 
accordance with established techniques. A portion of the genomic DNA 
encoding HOMPS can be deleted or replaced with another gene, such as a gene 
encoding a selectable marker that can be used to monitor integration. Typically, 
several kilobases of unaltered flanking DNA (both at the 5 1 and 3* ends) are 

10 included in the vector (see e.g., Thomas and Capecchi, 1987, Cell 51:503) for a 
description of homologous recombination vectors]. The vector is introduced 
into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced DNA has homologously recombined with the endogenous DNA are 
selected (see e.g., Li et al., 1992, Cell 69:915). The selected cells are then injected 

15 into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras 
(see e.g., Bradley, in Robertson, ed., 1987, Teratocarcinomas and Embryonic 
Stem Cells: A Practical Approach, (IRL, Oxford), pp. 113-152). A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term to create a "knock out" animal. Progeny 

20 harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of 
die animal contain the homologously recombined DNA. Knockout animals can 
be characterized for instance, for their ability to defend against certain 
pathological conditions and for their development of pathological conditions due 

25 to absence of the HOMPS polypeptide. 

IDENTIFYING MOLECULES THAT INTERACT WITH HOMPS 

The HOMPS protein sequences disclosed herein allow the skilled artisan 
to identify molecules that interact with them via any one of a variety of art 
accepted protocols. For example one can utilize one of the variety of so-called 
30 interaction trap systems (also referred to as the "two-hybrid assay"). In such 
systems, molecules that interact reconstitute a transcription factor and direct 
expression of a reporter gene, the expression of which is then assayed. Typical 
systems identify protein-protein interactions in vivo through reconstitution of a 
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eukaryotic transcriptional activator and are disclosed for example in U.S. Patent 
Nos. 5,955,280, 5,925,523, 5,846,722 and 6,004,746. 

Alternatively one can identify molecules that interact, with HOMPS 
protein sequences by screening peptide libraries. In such methods, peptides that 
5 bind to selected receptor molecules such as HOMPS are identified by screening 
libraries that encode a random or controlled collection of amino acids. Peptides 
encoded by the libraries are expressed as fusion proteins of bacteriophage coat 
proteins, and bacteriophage particles are then screened against the receptors of 
interest. Peptides having a wide variety of uses, such as therapeutic or diagnostic 

10 reagents, may thus be identified without any prior information on the structure of 
the expected ligand or receptor molecule. Typical peptide libraries and screening 
methods that can be used to identify molecules that interact with HOMPS protein 
sequences are disclosed for example in U.S. Patent Nos. 5,723,286 and 5,733,731. 
Alternatively, cell lines expressing HOMPS can be used to identify 

15 protein-protein interactions mediated by HOMPS. This possibility can be 
examined using imrnunoprecipitation techniques as shown by others (Hamilton, 
BJ., et aL, 1999, Biochem. Biophys. Res. Commun. 261:646-51). Typically 
HOMPS protein can be immunoprecipitated from HOMPS expressing cancer 
cell lines using anti-HOMPS antibodies. Alternatively, antibodies against His-tag 

20 can be used in cell line engineered to express HOMPS (vectors mentioned 
above). The immunoprecipitated complex can be examined for protein 
association by procedures such as western blotting, 35 S-methionine labeling of 
proteins, protein rnicrosequencing, silver staining and two dimensional gel 
electrophoresis. 

25 Related embodiments of such screening assays include methods for 

identifying small molecules that interact with HOMPS. Typical methods are 
discussed for example in U.S. Patent No. 5,928,868 and include methods for 
forming hybrid ligands in which at least one ligand is a small molecule. In an 
illustrative embodiments, the hybrid ligand is introduced into cells that in turn 

30 contain a first and a second expression vector. Each expression vector includes 
DNA for expressing a hybrid protein that encodes a target protein linked to a 
coding sequence for a transcriptional module. The cells further contains a 
reporter gene, the expression of which is conditioned on the proximity of the 
first and second hybrid proteins to each other, an event that occurs only if the 
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hybrid ligand binds to target sites on both hybrid proteins. Those cells that 
express the reporter gene are selected and the unknown small molecule or the 
unknown hybrid protein is identified. 

Methods which may also be used to quantitate the expression of HOMPS 
5 include radiolabeling or bio tiny lating nucleotides, coamplification of a control 
nucleic acid, and standard curves onto which the experimental results are 
interpolated (Melby, P. C et al. (1993) J. Immunol. Methods, 159:235-244; 
Duplaa, C. et al. (1993) Anal. Biochem. 229-236). The speed of quantitation of 
multiple samples may be accelerated by running the assay in an ELISA format 
10 where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantitation. 

In another embodiment of the invention, the nucleic acid sequences 
which encode HOMPS may also be used to generate hybridization probes which 
are useful for mapping the naturally occurring genomic sequence. The sequences 
15 may be mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. Such techniques include FISH, 
FACS, or artificial chromosome constructions, such as yeast artificial 
chromosomes, bacterial artificial chromosomes, bacterial PI constructions or 
single chromosome cDNA libraries as reviewed in Price, C. M. (1993) Blood Rev. 
20 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154. FISH (as described 
in Verma et al. (1988) Human Chromosomes: A Manual of Basic Techniques, 
Pergamon Press, New York, N.Y.) may be correlated with other physical 
chromosome mapping techniques and genetic map data. Examples of genetic 
map data can be found in the 1994 Genome Issue of Science (265:1981 f). 
25 Correlation between the location of the gene encoding HOMPS on a 

physical chromosomal map and a specific disease, or predisposition to a specific 
disease, may help delimit the region of DNA associated with that genetic disease. 
The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier, or affected individuals. 
30 In situ hybridization of chromosomal preparations and physical mapping 

techniques such as linkage analysis using established chromosomal markers may 
be used for extending genetic maps. Often the placement of a gene on the 
chromosome of another mammalian species, such as mouse, may reveal 
associated m a r kers even if the number or arm of a particular human 
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chromosome is not known. New sequences can be assigned to chromosomal 
arms, or parts thereof, by physical mapping. This provides valuable information 
to investigators searching for disease genes using positional cloning or other gene 
discovery techniques. Once the disease or syndrome has been crudely localized 
5 by genetic linkage to a particular genomic region, for example, AT to llq22-23 
(Gatti, R. A. et al. 

(1988) Nature 336:577-580), any sequences mapping to that area may represent 
associated or regulatory genes for further investigation. The nucleotide sequence 
of the subject invention may also be used to detect differences in the 

10 chromosomal location due to translocation, inversion, etc. among normal, carrier, 
or affected individuals. 

In another embodiment of the invention, HOMPS, its catalytic or 
immunogenic fragments or oligopeptides thereof, can be used for screening 
libraries of compounds in any of a variety of drug screening techniques. The 

15 fragment employed in such screening may be free in solution, affixed to a solid 
support, borne on a cell surface, or located intracellularly. The formation of 
binding complexes, between HOMPS and the agent being tested, may be 
measured. 

Another technique for drug screening which may be used provides for 
20 high throughput screening of compounds having suitable binding affinity to the 
protein of interest as described in published PCT application WO84/03564. In 
this method, as applied to HOMPS large numbers of different small test 
compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface. The test compounds are reacted with HOMPS, or fragments 
25 thereof, and washed. Bound HOMPS is then detected by methods well known in 
the art Purified HOMPS can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support 

30 In another embodiment, one may use competitive drug screening assays 

in which neutralizing antibodies capable of binding HOMPS specifically compete 
with a test compound for binding HOMPS. In this manner, the antibodies can be 
used to detect the presence of any peptide which shares one or more antigenic 
determinants with HOMPS. 
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In additional embodiments, the nucleotide sequences which encode 
HOMPS may be used in any molecular biology techniques that have yet to be 
developed, provided the new techniques rely on properties of nucleotide 
sequences that are currendy known, including, but not limited to, such properties 
5 as the triplet genetic code and specific base pair interactions. 

KITS 

For use in the applications described or suggested above, kits are also 
provided by the invention. Such kits may comprise a carrier means being 
compartmentalized to receive in close confinement one or more container means 
10 such as vials, rubes, and the like, each of the container means comprising one of 
the separate elements to be used in the method. For example, one of the 
container means may comprise a probe that is or can be detectably labeled. Such 
probe may be an antibody or polynucleotide specific for a HOMPS protein or a 
HOMPS gene or message, respectively. Where the kit utilizes nucleic acid 
1 5 hybridization to detect the target nucleic acid, the kit may also have containers 
containing nucleotide(s) for amplification of the target nucleic acid sequence 
and/or a container comprising a reporter-means, such as a biotin-binding 
protein, such as avidin or streptavidin, bound to a reporter molecule, such as an 
enzymatic, florescent, or radioisotope label. 
20 The kit of the invention will typically comprise the container described 

above and one or more other containers comprising materials desirable from 
a commercial and user standpoint, including buffers, diluents, filters, needles, 
syringes, and package inserts with instructions for use. A label may be present on 
the container to indicate that the composition is used for a specific therapy or non- 
25 therapeutic application, and may also indicate directions for either in vivo or in vitro 
use, such as those described above. 

EXAMPLES 

Various aspects of the invention are further described and illustrated by 
way of the several examples that follow, none of which are intended to limit the 
30 scope of the invention. 

L Cell Culture 
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Cells were grown in RPMI medium 1640, supplemented with 10 % fetal 
bovine serum, 2mM glutamine, and 1% penicillin G-s trep to mycin- fungizone 
solution. Cells were harvested at 80 % confluency for total RNA extraction. 

5 2. RNA Preparation 

Total cellular RNA was purified by guanidinium / cesium chloride 
ultracentrifugation (16). Messenger RNA was isolated by two passages through 
an oligo dT cellulose column (T3 - Collaborative Research) (17). The quality and 
mRNA composition of the resulting RNA population were confirmed by 
10 Northern blot analysis by probing with P-actin and HER-2 cDNAs. Both MCF- 
7/control and MCF-7/HER2 mRNA pools contain equivalent, basal expression 
of the endogenous HER-2 transcript whereas the transcript representing 
transfected HER-2 cDNA was present only in the MCF-7/HER2 cells. 

15 3. Construction of the cDNA Library 

Five ug of MCF-7/HER2 poly (A) + RNA was constructed into ZAP 
Express™ vector (Stratagene) according to the manufacturers protocol using 
materials provided in the cDNA synthesis kit The recombinant phage were 
packaged in Gigapack II Gold packaging extract (Stratagene). The packaged 
20 library was amplified one round through passage on XLl-Blue MRF' host cells 

(6x1 0 5 p£u/|il titer). As determined by the X-gal/IPTG color assay, the 
background (non -recombinant) phage level was less than 0.1%. 

4. Differential hybridization 

25 The MCF-7/HER2 cDNA library was plated on XLl-Blue MRF' host 

cells at a density of 2,000 pfu per each of eight 150 mm petri dishes. After 
plating, actin and HER-2 clones purified from the same library were each loaded 
onto four designated spots within the individual plates to be used as 
hybridization controls. The nitrocellulose filters (Millipore) were placed on the 

30 agar plates 1.5 min. for the first filter, 3 min. for the second, and 7 min. for the 
third. The phage DNA was denatured for 3 min. in a solution containing 0.5 M 
NaOH 1.5 M NaCl, neutralized for 3 min. in a solution containing 3 M NaCi 0.5 

M Tris pH 7.5, and rinsed in 2xSSC The treated filters were air-dried and baked 
at 80°C for 1 hour. 
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The radiolabeled cDNA probes (MCF7/control and MCF-7/HER2) 
were prepared as follows. Poly (A) + RNA was randomly labeled, by using both 
random hexamers and oligo dT primers, in 20 |il solution containing 1.0 jig poly 
(A) + RNA, Ix MMLV buffer, 1 mM each of dATP, dGTP, dTTP and 0.045 mM 
5 dCTP, 100 u^Ci y( 32 P]dCTP, 0.5 ug oligo dT( 15 ), 0.2 jig random primer, 20 U 

RNase inhibitor, and 200 U Moloney murine leukemia virus (MMLV) reverse 
transcriptase. The reaction was incubated at room temperature first for 10 min. 
to allow primer annealing and further incubated at 37°C for 1 hour. Upon 
addition of 4.6 |-ll of 0.5 M NaOH, the reaction mix was incubated at 70°C for 20 
10 min. Incorporated counts were eluted from a spin column (Chromaspin 100 / 
Clontech) in 1XTEN buffer (0.1 M NaCl, 10 mM Tris pH 8.0, 1 mM EDTA). 

Approximately 5.9x1 0^ dpm counts of MCF-7 /control probes were 
added to a first hybridization containing 26 ml hybridization solution and the first 
set of 8 filters obtained from each plate. Equal counts of MCF-7/HER2 probes 
15 were added to the second set of filters. The third set of filters were hybridized 
with radioactive HER-2 cDNA in order to avoid selecting the HER-2 containing 
clones. Hybridization solution contained 50% formamide, 25X Denhardts, 

sonicated salmon sperm DNA, NaPO^pH 6.8, sodium pyrophosphate, and ribo 

ATP. Prehybridization was performed at 42°C for 4 h. and hybridization at 42°C 

20 for 4 days (overnight hybridization for the third set of filters). Filters were 
washed at room temperature for 5 min. (x3) in 0.2xSSC/0.1%SDS, and at.60°C 
for 15 min. (x7) in the same solution. The washed filters were exposed with an 
intensifying screen to Kodak-XAR5 films at -70°C for various time periods. 
Autoradiograms were analyzed to compare differences in focal signal intensity 

25 between the replica filters. 

For the secondary screenings, the primary screening procedure was 
repeated except that each clone was separately plated onto a 100 mm petri dish at 
a low density of 25-50 plaques/plate . . 

30 5. Probe generation for Northern hybridization 

The pBK-CMV phagemid was in vivo excised from the lamdaphage 
vector according to the manufacturer's instructions (Stratagene). The cDNA 
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inserts were isolated from the plasmid either by restriction enzyme digestions or 
by PCR amplification using T3 and T7 sequences as primers. 

6. Northern Hybridization 

.5 Either 2 ug of poly (A) + RNA or 20 ug of total RNA was loaded onto a 

1% formaldehyde agarose gel and electrophoresed at 70 V for 4 hours. The 
RNA was transferred to a nylon membrane in lOx SSC. The purified cDNA 
inserts were random-labeled in a 50 jil reaction mix which contained 50 ng 
template, [y-^P] dCTP, 20 jig BSA, 6 U Klenow. Incorporated counts were 

10 eluted from a G-50 Sephadex spin column (Pharmacia). Approximately 3x1 0^ 
dpm counts per 1 ml hybridization solution were used. The hybridization was 
carried out in 50% formamide, 2xSSC, 0.1% SDS, 10 mg/ml salmon sperm 
DNA, and 10% dextran sulfate, at 42°C for 16 hours. Membranes were washed 
in 2xSSC/0.1%SDS at 25°C for 10 min. (x3), and in the same solution at 65°C 

15 for 5 min. (x2). The washed membranes were exposed with an intensifying 
screen to Kodak-XAR film at -70°C. 

7. DNA Sequencing and Computer Analysis 

Minipreparations of pBK-CMV plasmid vector (Qiawell 8 Ultra, Qiagen) 
20 were sequenced with T3/T7 promoter primers and internal primers using an 
automatic DNA sequencer (Applied Biosystems Model 373A). The sequence 
similarity search was performed using GenBank and EMBL DNA databases. 

8. In Vitro Transcription / Translation 

25 The cDNA inserts were translated into polypeptides in a TNT coupled 

reticulocyte system (Promega) according to the manufacturer's protocol ; 
Approximately lug of purified plasmid template was transcribed and translated in 
the 50 ul reaction containing T3 RNA polymerase, rabbit reticulocyte lysate, 
[ 35 S]methionine, etc. 5 ul of the end product was aliquoted to estimate the 

30 molecular size of the in vitro translated protein using 10% SDS-PAGE and 
prestained protein size markers (Bio-Rad). 

9. Extraction of total RNA from breast tumor samples 
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Breast tumors were obtained from patients at the time of surgery as part 
of a core tissue procurement resource sponsored by the DOD breast cancer 
program. All tumor samples were snap frozen in liquid nitrogen and kept at - 
70°C before extraction of RNA. Frozen tissues were pulverized in liquid 
5 nitrogen prior to homogenization in cold 4 M guanidine thiocyanate buffer (7.5 
ml/g of tissue). The homogenates were centrimged for 10 min. at 4°C at 8000 g 
in order to remove cell debris. RNA was sedimented through a cesium chloride 
gradient (5.7 M / 2.4 M CsCl2) via ultracentrifugation (18 h at 36,000 rpm, 20°C). 
The separated RNA phases were extracted with phenol-chloroform prior to wash 
10 with 100 % ethanol. The RNA pellet was precipitated by adding 2 ml of 0.4 M 
sodium acetate and 2.5 vol. of 100 % ethanol, and storing over night at -20°C. 
After centrifugation (20 min at 10,000 g), the pellets were dried and dissolved in 
DEPC water. 

15 10. Isolation of Differentially Expressed Genes Associated with HER- 
2/neu Overexpression 

In our first round of differential screening, 1 6,000 clones from the MCF- 
7/HER-2 library were analyzed. Clones showing a stronger signal intensity 
hybridized with the MCF-7 /control cell cDNA probes were labeled "C" clones 

20 (CI, C2, C3, etc.), whereas those demonstrating a stronger signal intensity 
hybridized with HER-2 overexpressing cell cDNA probes were labeled "H" 
clones (HI, H2, H3, etc.). From this primary screening, a total of 127 
differentially expressed clones were isolated including 77 C clones and 50 H 
clones representing genes whose expression levels are decreased (C clones) or 

25 increased (H clones) respectively in association with HER-2 overexpression. 
Each clone was ranked according to degree of differential hybridization based on 
signal intensity ranging from more than a five fold to less than a two fold change 
based on visualization. 

Forty-three C clones and 36 H clones which demonstrated the greatest 

30 differences in signal intensity in the primary screening were taken through 
secondary screening to ensure consistent differential expression and to isolate 
pure colonies. Subsequent to this isolation, the clones were cross-hybridized to 
determine redundancy. This resulted in a total of 7 non-redundant C clones and 
12 non-redundant H clones. Finally, to confirm our screening technique, 
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differential expression patterns of the selected clones were evaluated by Northern 
blot analysis of RNA from MCF-7/control and HER-2 cells. A total of 5 C 
clones and 1 1 H clones showed expression patterns consistent with expectations 
from the differential hybridization approach while 2 C clones and 1 H clone 
5 failed to demonstrate the anticipated pattern. 

11. DNA Sequencing and Identification of the Clones 

Individual clones whose differential expression was confirmed by 
Northern blot analysis were subsequendy analyzed by DNA sequencing,, and 

10 these data demonstrate that full-length cDNAs were obtained for most of the 
clones. The differentially expressed genes were grouped into three different 
classes based on computer searches against the GenBank and EMBL data bases; 
(1) known genes with previously characterized function (2) previously identified 
genes with relatively uncharacterized function (3) novel sequences (summarized 

15 in Table 1 below). In addition, each clone was grouped into three different 
categories based on significance of their relative difference in expression (Table 
1). Even genes whose differential expression is small (-2 fold) were included if 
this difference was corisistendy reproducible in multiple analyses. 

As stated above, the MCF-7/HER-2 cells behave significandy differendy 

20 than their isogenic control parental counterparts; with increases in DNA 
synthesis, cell growth in vitro, soft agar cloning efficiency and tumorigenicity. Nine 
of the differentially expressed genes identified in this study are known to be 
associated with the malignant phenotype. For example, cytokeratin 8 (C29) 
(Taniguchi, T., Fujii-Kuriyama, Y. and Muramatsu, M. (1980) Proc Natl Acad Sci U 

25 S A, 77(7), 4003-6), cytokeratin 18 (C49) (Oshima, R. G., Millan, J. L. and 
Cecena, G. (1986) Differentiation, 33(1), 61-8), and gamma actin (C72) (Erba, H. 
P., Gunning, P. and Kedes, L. (1986) Nucleic Acids Res, 14(13), 5275-94) are 
cytoskeletal proteins essential for maintaining both cell shape and motility of 
normal cells and their expression differs from levels seen in the aberrant 

30 cytoskeleton of cancer cells (Mukhopadhyay, T. and Roth, J. A. (1996) Anticancer 
Res, 16(1), 105-12). Similarly GAPDH (H31) (Tokunaga, K., Nakamura, Y., 
Sakata, K., Fujimori, K., Ohkubo, M, Sawada, K. and Sakiyama, S. (1987) Cancer 
Res, 47(21), 5616-9) and succinyl CoA transferase (H45) (Kassovska-Bratinova, 
S., Fukao, T., Song, X. Q., Duncan, A. M, Chen, H. S., Robert, M. F., Perez- 
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Cerda, C, Ugarte, M., Chartrand, C, Vobecky, S., Kondo, N. and Mitchell, G. A. 
(1996) Am J Hum Genet, 59(3), 519-28) are involved in the metabolic pathway of 
more rapidly growing cells including cancer cells. The increased expression of 
ribosomal proteins, L8 (Hi 6) (Hanes, J., Klaudiny, J., von der Kammer, H. and 
5 Scheit, K. H, (1993) Biochem Biopbys Res Commun, 197(3), 1223-8) and LLrep3 
(H35) (Heller, D. L., Gianola, K. M. and Leinwand, L. A. (1988) Mot Cell Biol, 
8(7), 2797-803), is consistent with the increased rate of protein translation 
required for cancer cell growth. Two other genes identified in our study whose 
cellular functions have been previously characterized are Cathepsin D (C31) 
10 (Faust, P. L., Kornfeld, S. and Chirgwin, J. M. (1985) Prvc Natl Acad Sci USA, 
82(15), 4910-4), an acidic lysosomal protease, and the 90 kDa heat shock protein 
(HI 8) (Rebbe, N. F., Ware, J., Bertina, R. M., Modrich, P. and Stafford, D. W. 
(1987) Gene, 53(2-3), 235-45) which is a chaperon protein associated with steroid 
hormone receptor genes. Both of these genes are known to be differentially 
15 expressed in malignant cells (Johnson, M. D., Torri, J. A., Lippman, M E. and 
Dickson, R. B. (1993) Cancer Res, 53(4), 873-7; Mileo, A. M., Panuele, M., 
Battaglia, F., Scambia, G., Benedetti-Panici, P., Mancuso, S. and Ferrini, U. (1990) 
Anticancer Res, 10(4), 903-6). 

Three genes (HI 3, HI 4, H37) found to be overexpressed in the HER-2 
20 overexpressing cells matched cDNA sequences which were previously identified 
by other investigators but not fully characterized. The HI 3 clone appears to be 
an alternate splice variant of DNA fragmentation factor (DFF) (GenBank 
accession no. U91985) (Liu, X., Zou, H., Slaughter, C. and Wang, X. (1997) Cell, 
89(2), 175-84); the first 261 amino acid sequences contained in both HI 3 and 
25 the DFF open reading frames are identical, but HI 3 lacks 70 amino acids at the 
3' end and contains 7 different amino acids in their place. The predictive amino 
acid sequence for HI 3 is highly similar to the ICAD (Inhibitor of Caspase- 
Activated Dnase)- S and -L proteins (Sakahira, H., Enari, M. and Nagata, S. 
(1998) Nature, 391(6662), 96-9; Enari, M., Sakahira, H., Yokoyama, H., Okawa, 
30 K., Iwamatsu, A. and Nagata, S. (1998) Nature, 391(6662), 43-50) (73% and 69% 
sequence identity, respectively). In order to ensure that the cDNAs cloned, 
either uncharacterized or novel, can be efficiendy translated into protein products 
of expected sizes, we performed in vitro - translation experiments. As predicted, 
the HI 3 cDNA was translated into the polypeptide of approximately 30 kDa 
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(Figure 13) Clone H14 is identical to DRP-1 (Density regulated protein-1 / 
GenBank accession no. AF038554) (Deyo, J. E., Chiao, P. J. and Tainsky, M. A. 
(1998) UNA CellBiol, 17(5), 437-47) except that it lacks 285 bp at the 5' end and 
has 23 additional bp at the 3* end plus a poly (A) tail. Both the H14 and DRP-1 
cDNAs are likely to be partial, 3' end sequences of a larger transcript since no 
suitable initiating codon was found in either sequences, and the HI 4 cDNA 
hybridized with an additional transcript of approximately 6.5 kb on Northern 
blot analysis. According to the EST (Expressed Sequence Tags) database 
analysis, the 5' end of the HI 4 cDNA sequence can be extended, and the ESTs 
covering the extended portion of the gene is designated THC202438 (deposited 
in the Tentative Human Consensus effort) (Kirkness, E. F. and Kerlavage, A. R. 
(1997) Methods Mo/ Biol, 69, 261-8). The H37 cDNA sequence has been 
previously deposited into GenBank as RNA binding motif protein 5 (RBM5) 
(accession no. AF091263, unpublished) found within a region reported to be 
homozygously deleted in lung cancer and believed to contain (a) major tumor 
suppressor gene(s) involved in a majority of small cell and non-small cell lung 
cancers (Wei, M. H, Latif, F., Bader, S., Kashuba, V., Chen, J. Y., Duh, F. M., 
Sekido, Y., Lee, C C, Geil, L., Kuzmin, L, Zabarovsky, E., Klein, G., Zbar, B., 
Minna, J. D. and Lerman, M. I. (1996) Cancer Res, 56(7), 1487-92). The H37 
cDNA contains an open reading frame of 816 amino acid and is translated in vitro 
into a predicted protein product of approximately 90 kDa (Figure 13). Analysis 
of the putative H37 protein against PROSITE protein profde databases 
recognized the presence of two RNA binding domains, located at amino acid 
residues 140-147 and 274-281, which are perfect matches with the consensus 
eukaryotic sequence for a putative RNA-binding region RNP-1 (Bandziulis, R. J., 
Swanson, M. S. and Dreyfuss, G. (1989) Genes Dev, 3(4), 431-7). 

Four clones (C40, HI 7, H41, H63) represented as-yet unknown genes in 
the DNA databases, and three of these (C40, H17, H41) were found to contain 
probable open reading frames. The 1750 bp long C40 clone contains a 510 
araino acid open reading frame (Figure 14A). The coding region of this gene 
begins with a start codon at nucleotide position 74 and has an in-frame stop 
codon at position 1604 (Figure 14A). The C40 clone was in vitro - translated into 
the predicted major protein product of approximately 55 kDa (Figure 13). The 
385 bp region (nt 568-952) of this gene is 89% identical to a GenBank transcript 
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(accession no. U56429), however, the putative C40 protein does not share any 
significant homology to any known proteins in the databases. Examination of 
die C40 protein sequence using the GCG program Motifs revealed the presence 
of a leucine zipper motif at amino acid positions 104-125, which is a perfect 
5 match with the consensus sequence for the leucine zipper pattern (Steeg, P. S., 
Bevilacqua, G., Kopper, L., Thorgeirsson, U. P., Talmadge, J. E,, Liotta, L. A. 
and Sobel, M. E. (1988) ] Natl Cancer Inst, 80(3), 200-4) (Figure 14A). 

The 1981 bp long HI 7 clone contains a consensus initiation codon 
(Kozak, M. (1991) J BiolCbem, 266(30), 19867-70) at nucleotide 66 followed by a 

10 486 amino acid open reading frame and a 458 bp 3* untranslated region including 
a polyadenylation signal (AATAAA) (Figure 14B). As is known in the art, the 
nucleic acid sequence around the 5' proximal AUG codon is typically a Kozak 
consensus sequence where eukaryotic ribosomes initiate translation and the 
general rule the eukaryotic ribosomes initiate translation exclusively at the 5' 

15 proximal AUG codon is abrogated only under rare conditions (see e.g. Kozak 
PNAS 92(7): 2662-2666, (1995) and Kozak NAR 15(20): 8125-8148 (1987)). In 
vitro translation generated the predicted protein product of approximately 50 
kDa (Figure 13). The putative H17 protein has 39.3% identity with a C. elegans 
cDNA of unknown function (Z77667) over a 422 amino acid region (aa 61-482) 

20 and is similar to a metabolic enzyme sarcosine oxidase (AE001086) with 29.2% 
identity in a 171 amino acid region (aa 66-236) (Figure 14B). 

The 3346 bp H41 clone contains a 323 bp 5' untranslated region followed 
by an initiation codon with a Kozak consensus (Kozak, M. (1991) / Biol Cbem, 
266(30), 19867-70) and an extensive, 2249 bp 3' untranslated region (Figure 14C). 

25 The 258 amino acid residues encoded by its open reading frame was translated in 
vitro into the predicted protein product of approximately 30 kDa and an 
additional protein of lower molecular weight (Figure 13). The putative H41 
protein is related to one of the "fast evolving" drosophila genes of unknown 
function (AF005858) (Schmid, K. J. and Tautz, D. (1997) Proc Natl Acad Sci U S 

30 A y 94(18), 9746-50) with 28.7% identity in a 167 amino acid region (aa 9-175) 
(Figure 14C). According to a pSORT protein database search, the H41 gene 
product is predicted to be a nuclear protein based on the presence of a nuclear 
localization signal, 4 basic amino acid (lysine) residues, at its N-terminus (94.1% 
reliability by Reinhardt's method) (Reinhardt, A. and Hubbard, T. (1998) Nucleic 
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Acids Res, 26(9), 2230-6) (Figure 14C). For the above three novel cDNA 
sequences (C40, HI 7, H41), we could not find any ESTs which would extend our 
sequences further either at the 5' or 3' ends. 

Lastly, the novel H63 clone is believed to be a partial, 3' sequence of a 
longer transcript because this sequence did not contain any probable open 
reading frames, and the 2068 bp DNA hybridized with a transcript of 
approximately 4.5 kb on a Northern blot. According to the EST database 
analysis, die 5' end of the H63 cDNA sequence can be extended and assembled 
as THC1 75350. 

Table 1: Identity of differentially expressed clones 



Clone U> 


Size* 
(bp) 


done Identity mRNA sifcc 1 

(kb) 


Rchovc difference 
in expression* 




(1) Genes with previously characterized function"* 






C29 


1777 


Keratin 8 


1.8 


U 


C31 


2053 


Cathcpsin D 


2.1 


u 


04') 


1423 


Keratin 18 


1.4 


u 


C72 


1940 


Gamma actin 


1.9 


i 


Hi 6 


933 


Ribosomal protein IJ3 


0.9 


t 


1118 


2530 


90-kDa heat-shock protein 


1.2, 25 


ttt 


H3I 


1284 


Glyccraldchydc-3-phosphatc dehydrogenase 


1.3 


tt 


H35 


948 


LLRcp3 


0.9 


TTT 


H45 


2317 


Succinyl CoA:3-oxoacid CoA transferase 


1.5,3.3,5.3 


tt 


(2) Recently identified genes 


with relatively un characterized function 






HI 3 


1027 


DNA fragmentation factor (DPP) 


1.0. 1.4, 3.4,6.4 


tt 


MM 


2214 


Density Regulated Protein- 1 (DR1M) 


0.96, 22, 6.5 


ttt 


H37 


3091 


RN A binding motif protein 5 (RJ3M5) 


1.9,3.1,6.5 


tt 



15 



20 



(3) Novel sequences 
C40 1750 
H17 1981 
H41 3.346 
H63 2068 



Not previously identified 1.8, 26, 4.9 X 

Not previously identified 20, 4.4 ftf 

Not previously identified 1 ,9, 27, 3.3, 4.0 ttt 

Not previously identified 1.9, 4.5 f 

•Prefix "C M and "H" denotes genes whose expression level decrease and increase, respectively, in MCF-7/HELR-2 cells 

cennpared 

to the control cells. 

u 1 he size of cDNAs cloned in the differential screening and used as probes for Northern blot analysis. 
'The sizes of differentially expressed transcripts on Northern autoradiogram. 

"Directions of arrows indicate expression level increase (t) or decrease (i), respectively, in HER-2 overexpressing cells. 
'l*he number of arrows indicates relative difference in expression level change by visualization, i.e. t = ~2 fold tt = 
3-5 fold, ttt = >S fold. 

«'Ilie nucleotide sequences reported in thus study have been submitted to die Gen Ban t™/EMBL database and assigned 
these 

accession numbers. 
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12. Confirmation of Differential Expression in Ovarian Cancer Cell 
Counterparts 

5 To ensure that differential expression of these genes is a phenomenon 

consistently associated with HER-2//z«/ overexpression rather than a unique 
event restricted to a single cell line, we evaluated the expression of these genes in 
human ovarian cancer cells (CaOv-3) engineered to overexpress HER-2/ ncu in a 
fashion identical to the MCF-7/HER-2 cells (Pietras, R. J., Fendly, B. M., Chazin, 

10 V. R., Pegram, M. D., Howell, S. B. and Slamon, D. J. (1994) Oncogene, 9(7), 1829- 
38; Pegram, M. D., Finn, R. S., Arzoo, K., Beryt, M., Pietras, R. J. and Slamon, D. 
J. (1997) Oncogene, 15(5), 537-47). The amount of HER-2/W protein expressed, 
as determined by quantitative Western blot analysis, was approximately 1.62 
pg/celi for MCF-7/HER-2 cells and 1.14 pg/cell for the CaOv-3/HER-2 cells as 

15 compared to 0.36 pg/cell and 0.41 pg/cell for the control transfected cells 
respectively (Press, M. F., Pike, M. C, Chazin, V. R., Hung, G., Udove, J. A., 
Markowicz, M., Danyluk, J., Godolphin, W., Sliwkowski, M., Akita, R. and et aL 
(1993) Cancer Res, 53(20), 4960-70). In addition, the biologic changes induced by 
HER-2/ neu overexpression in the human ovarian cancer cells were similar to 

20 those seen and described above in the human breast cancer cells (Chazin, V.R. 
(1991). The biologic effects of HER-2/ neu proto-oncogene overexpression, 
Chapter 2. Department of Microbiology and Immunology, University of 
California, Los Angeles). Based on this consistent pattern of biologic changes 
induced by HER-2/ neu overexpression for both the MCF-7/HER-2 and CaOv- 

25 3/HER-2 cell lines, we would anticipate that at least some of the changes in 
mRNA expression patterns associated with HER-2 overexpression might be 
similar between these two cell lines if these genes are relevant to the HER-2//z«/ 
overexpressing phenotype. Northern blot analysis of ovarian cancer cell line pair 
demonstrated that 12 of 16 (75%) clones found to be differentially expressed in 

30 MCF-7/ HER-2 as compared to isogenic control breast cancer cells were also 
differentially expressed in the CaOv-3 ovarian cancer cells (Figure 15). In 
addition, for most of the clones, the degree of differential expression was 
consistent between these two distinct epithelial cell lines. This phenomenon is 
seen with clones H17, HI 8, H35, H37, and H41 which show marked increases in 

35 expression levels in association with HER-2/ neu overexpression and clones C49, 
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C72, HI 6, H45, and H63 which show more subtle differences (Figure 15). Four 
of the MCF-7 differentially expressed clones (C31, C40, HI 4, H31) did not 
demonstrate any noticeable difference in expression in CaOv-3/HER-2 vs. 
control cells. 

5 

13. Differential Expression of Two of the Identified cDNAs in Primary 
Human Breast Cancer Samples 

To further confirm differential expression of the novel (or 

10 uncharacterized) cDNAs in actual human malignancies, we examined a panel of 
primary human breast cancer specimens by Northern biot analyses. Among 
these, we were able to detect clear signals on the tumor Northern blots for two 
of the tested clones, H37 and H41. Less success with the other cDNAs (i.e. C40, 
HI 3, HI 4, HI 7, H63) can be best explained with their relatively rare message 

15 level. For the H37 cDNA, 15 individual cancer samples were analyzed, and 8 of 
these (#3, 4, 5, 7, 8, 12, 14, 15) overexpress HER-2/W (Figure 16A). Seven of 
these eight tumors (88 %) demonstrated overexpression of the H37 transcript, 
while only one of seven (14%) of the non-HER-2 overexpressors overexpresses 
this cDNA <p =0.732, p<0.005) (Figure 16A). For the H41 cDNA, 5 of 7 (71 %) 

20 of the HER-2 overexpressing malignancies overexpress the gene while two of 
eight (25 %) non-HER-2 overexpressors were found to have increased levels of 
this novel transcript (□ =0.464, p<0.075) (Figure 16B). Tumor #8 did not 
express high levels of either H37 nor H41 despite its high HER-2 expression 
level. Overall, these data suggest that the above two novel genes may be 

25 contribute in some way to* the phenotype associated with HER-2 overexpression. 
Further studies of this association are currendy underway using greater numbers 
of tumor specimens. 

14. Extension of HOMPS-Encoding Polynucleotides 

30 Nucleic acid sequences of or Figure 1, Figure 3, Figure 5, Figure 7, Figure 

8 or Figure 10 can be used to design oligonucleotide primers for extending a 
partial nucleotide sequence to full length or for obtaining 5' or 3', intron or other 
control sequences from genomic libraries. One primer is synthesized to initiate 
extension in the antisense direction (XLR) and the other is synthesized to extend 

35 sequence in the sense direction (XLF). Primers are used to facilitate the extension 
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of the known sequence "outward" generating amplicons containing new, 
unknown nucleotide sequence for the region of interest. The initial primers are 
designed from the cDNA using OLIGO 4. 06 (National Biosciences), or another 
appropriate program to be 22-30 nucleotides in length, to have a GC content of 
5 50% or more, and to anneal to the target sequence at temperatures about 68 ° -72 
C Any stretch of nucleotides which would result in hairpin structures and 
primer-primer dimerizations is avoided. 

The original, selected cDNA libraries, or a human genomic library are 
used to extend the sequence; the latter is most useful to obtain 5 1 upstream 
10 regions. If more extension is necessary or desired, additional sets of primers are 
designed to further extend the known region. 

By following the instructions for the XL-PCR kit (Perkin Elmer) and 
thoroughly mixing the enzyme and reaction mix, high fidelity amplification is 
obtained. . 

15 Beginning with 40 pmol of each primer and the recommended 

concentrations of all other components of the kit, PCR is performed using the 
Peltier Thermal Cycler (PTC200; M. J. Research, Watertown, Mass. ) and the 
following parameters: 



20 Step 1 94 °C. for 1 min (initial dehaturation) 

Step 2 65 °C fori min 

Step 3 68 °C for 6 min 

Step 4 94 °C. for 15 sec 

Step 5 65 °C. for 1 min 

25 Step 6 68°Cfor7min 

Step 7 Repeat step 4-6 for 15 additional cycles 

Step 8 94°Cfor l5sec 

Step 9 65 °C fori min 

Step 10 68 °C for 7:15 min 

30 Step 1 1 Repeat step 8-10 for 1 2 cycles 

Step 12 72 °C for 8 min 

Step 13 4°C. (and holding) 
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A 5-10 ul aliquot of the reaction mixture is analyzed by electrophoresis on 
a low concentration (about 0. 6-0. 8%) agarose mini-gel to determine which 
reactions were successful in extending the sequence. Bands thought to contain 
the largest products are selected and removed from the gel. Further purification 
5 involves using a commercial gel extraction method such as the QIA QUICK kit 
(QIAGEN Inc. , Chatsworth, Calif.). After recovery of the DNA, Klenow 
enzyme is used to trim single-stranded, nucleotide overhangs creating blunt ends 
which facilitate religation and cloning. 

After ethanol precipitation, the products are redissolved in 13 ul of 

10 ligation buffer, 1 ul T4-DNA ligase (15 units) and 1 ul T4 polynucleotide kinase 
are added, and the mixture is incubated at room temperature for 2-3 hours or 
overnight at 16 ° C Competent E. coli cells (in 40 ul of appropriate media) are 
transformed with 3 ul of ligation mixture and cultured in 80 ul of SOC medium 
(Sambrook et al. , supra). After incubation for one hour at 37 ° C. , the whole 

15 transformation mixture is plated on Luria Bertani (LB)-agar (Sambrook et al, 
supra) containing 2. times. Carb. The following day, several colonies are 
randomly picked from each plate and cultured in 1 50 ul of liquid LB/2, times. 
Carb medium placed in an individual well of an appropriate, commercially- 
2vailable, sterile 96-well microtiter plate. The following day, 5 ul of each 

20 overnight culture is transferred into a non-sterile 96-well plate and after dilution 
1:10 with water, 5 ul of each sample is transferred into a PCR array. 

For PCR amplification, 18 ul of concentrated PCR reaction mix (3. 3. 
times. ) containing 4 units of rTth DNA polymerase, a vector primer, and one or 
both of the gene specific primers used for the extension reaction are added to 

25 each well. Amplification is performed using the following conditions: 



Stepl 94°Cfor60sec 

Step 2 94 °C. for 20 sec 

Step 3 55 °C for 30 sec 

30 Step 4 72 °C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles 

Step 6 72 °C. for 180 sec 

Step 7 4°C. (and holding) 
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Aliquots of the PCR reactions arc run on agarose gels together with 
molecular weight markers. The sizes of the PCR products are compared to the 
original partial cDNAs, and appropriate clones are selected, ligated into plasmid, 
and sequenced. 

5 

15. Labeling and Use of Hybridization Probes 

Hybridization probes derived from Figure 1, Figure 3, Figure 5, Figure 7, 
Figure 8 or Figure 10 are employed to screen cDNAs, genomic DNAs, or 
mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base- 

10 pairs, is specifically described, essentially the same procedure is used with larger 
cDNA fragments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4. 06 (National Biosciences), labeled by combining 50 pmol of 
each oligomer and 250 \iCi of 32 P adenosine triphosphate (Amersham Pharmacia 
Biotech) and T4 polynucleotide kinase (DuPont NEN, Boston, Mass. ). The 

15 labeled oligonucleotides are substantially purified with SEPHADEX G-25 
superfine resin column (Pharmacia & Upjohn). A portion containing 10 7 counts 
per minute of each of the sense and antisense oligonucleotides is used in a typical 
membrane based hybridization analysis of human genomic DNA digested with 
one of the following endonucleases (Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu 

20 II; DuPont NEN). 

The DNA from each digest is fractionated on a 0. 7 percent agarose gel 
and transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, 
Durham, N. H. ). Hybridization is carried out for 16 hours at 40°C. To remove 
nonspecific signals, blots are sequentially washed at room temperature under 

25 increasingly stringent conditions up to 0. 1. times, saline sodium citrate and 0. 5% 
sodium dodecyl sulfate. After XOMAT AR film Eastman Kodak Rochester, N. 
Y. ) is exposed to the blots in a Phosphoimager cassette (Molecular Dynamics, 
Sunnyvale, Calif) for several hours, hybridization patterns are compared visually. 

30 16. Antisense or Complementary Sequences 

Antisense molecules or nucleic acid sequences complementary to the 
HOMPS-encoding sequence, or any part thereof, are used to inhibit in vivo or in 
vitro expression of naturally occurring HOMPS. Although use of antisense 
oligonucleotides, comprising about 20 base-pairs, is specifically described, 
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essentially the same procedure is used with larger cDNA fragments. An 
oligonucleotide based on the coding sequences of HOMPS, is used to inhibit 
expression of naturally occurring HOMPS. The complementary oligonucleotide 
is designed from the most unique 5' sequence and used either to inhibit 
5 transcription by preventing promoter binding to the upstream nontranslated 
sequence or translation of an HOMPS-encoding transcript by preventing the 
ribosome from binding. Using an appropriate portion of the signal and 5 f 
sequence of Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10, an effective 
antisense oligonucleotide includes any 15-20 nucleotides spanning the region 
10 which translates into the signal or 5' coding sequence of the polypeptide. 

17. Expression of HOMPS 

Expression of HOMPS is accomplished by subcloning the cDNAs into 
appropriate vectors and transforming the vectors into host cells. In this case, the 

15 cloning vector, pINCYl, previously used for the generation of the cDNA library 
is used to express HOMPS in E. coli. Upstream of the cloning site, this vector 
contains a promoter for P-galactosidase, followed by sequence containing the 
ammo-terminal Met, and the subsequent seven residues of P-galactosidase. 
Irnmediately following these eight residues is a bacteriophage promoter useful for 

20 transcription and a linker containing a number of unique restriction sites. 

Induction of an isolated, transformed bacterial strain with IPTG using 
standard methods produces a fusion protein which consists of the first eight 
residues of p-galactosidase, about 5 to 15 residues of linker, and the full length 
protein. The signal residues direct the secretion of HOMPS into the bacterial 

25 growth media which can be used directly in the following assay for activity. 

18. Production of HOMPS Specific Antibodies 

HOMPS that is substantially purified using PAGE electrophoresis 
(Sambrook, supra), or other purification techniques, is used to immunize rabbits 
30 and to produce antibodies using standard protocols. The amino acid sequence 
deduced from Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10 is analyzed using 
LASERGENE software (DNASTAR Inc) to determine regions of high 
lrnmunogenicity and a corresponding oligopolypeptide is synthesized and used to 
raise antibodies by means known to those of skill in the art. Selection of 
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appropriate epitopes, such as those near the C-tetminus or in hydrophilic regions, 
is described by Ausubel et al. (supra), and others. 

Typically, the oligopeptides are 1 5 residues in length, synthesized using an 
Applied Biosys terns Peptide Synthesizer Model 431 A using fmoc-chemistry, and 
5 coupled to keyhole limpet hemocyanin (KLH, Sigma, St. Louis, Mo. ) by reaction 
with N-maleimidobenzoyl-N-hyckoxysuccinirnide ester (MBS; Ausubel et al. , 
supra). 

Rabbits are immunized with the oligopeptide-KLH complex in complete 
Freund's adjuvant. The resulting antisera are tested for antipeptide activity, for 
10 example, by binding the peptide to plastic, blocking with 1% BSA, reacting with 
rabbit antisera, washing, and reacting with radioiodinated, goat anti-rabbit IgG. 

19. Purification of Naturally Occurring HOMPS Using Specific 
Antibodies 

15 Naturally occurring or recombinant HOMPS is substantially purified by 

immunoafrinity chromatography using antibodies specific for HOMPS. An 
immunoaffinity column is constructed by covalendy coupling HOMPS antibody 
to an activated chromatographic resin, such as CnBr-activated SEPHAROSE 
(Amersham Pharmacia Biotech). After the coupling, the resin is blocked and 

20 washed according to the manufacturer's instructions. 

Media containing HOMPS is passed over the immunoaffinity column, 
and the column is washed under conditions that allow the preferential 
absorb ance of HOMPS (e. g. , high ionic strength buffers in the presence of 
detergent). The column is eluted under conditions that disrupt antibody /HOMPS 

25 binding (e.g., a buffer of pH 2-3 or a high concentration of a chaotrope, such as 
urea or thiocyanate ion), and HOMPS is collected. 

20. Identification of Molecules Which Interact with HOMPS 

HOMPS or biologically active fragments thereof are labeled with 125 I 
30 Bolton-Hunter reagent (Bolton et al. (1973) Biochem. J. 133: 529). Candidate 
molecules previously arrayed in the wells of a multi-well plate are incubated with 
the labeled HOMPS, washed and any wells with labeled HOMPS complex are 
assayed. Data obtained using different concentrations of HOMPS are used to 
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calculate values for the number, affinity, and association of HOMPS with the 
candidate molecules. 

All publications and patents mentioned in the specification are herein 
5 incorporated by reference. Various modifications and variations of the described 
method and system of the invention will be apparent to those skilled in the art 
without departing from the scope and spirit of the invention. Although the 
invention has been described in connection with specific preferred embodiments, 
it should be understood that the invention as claimed should not be unduly 
10 limited to such specific embodiments. Indeed, various modifications of the 
described modes for carrying out the invention which are obvious to those skilled 
in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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CLAIMS 

I . An isolated polynucleotide that encodes a polypeptide selected from the 
group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 9 and SEQ ID NO: 11. 

5 2. An isolated polypeptide comprising the amino acid sequence selected 
from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 9 and SEQ ID NO: 11. 

3. An isolated variant of the polynucleotide selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 

10 NO: 7, SEQ ID NO: 8 and SEQ ID NO: 10. 

4. An isolated variant of the polypeptide selected from the group consisting 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9 and 
SEQ ID NO: 11. 

5. An isolated polynucleotide that hybridizes to a sequence selected from 
15 the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 

SEQ ID NO: 7, SEQ ID NO: 8 and SEQ ID NO: 10 under highly 
stringent conditions. 

6. A polynucleotide sequence which is the complement of a polynucleotide 
of claim 1. 

20 7. A composition comprising a polynucleotide of claim 1. 

8. A composition comprising a polypeptide of claim 2. 

9. An expression vector containing a polynucleotide of claim 1, 

10. A host cell containing the vector of claim 9. 

II. A method for producing a polypeptide comprising the amino acid 
25 sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 

NO: 4, SEQ ID NO: 6, SEQ ID NO: 9 and SEQ ID NO: 11 comprising 
the steps of: a) culturing the host cell of claim 10 under conditions 
suitable for the expression of the polypeptide; and b) recovering the 
polypeptide from the host cell culture. 
30 12. An isolated antibody capable of binding to an epitope of a polypeptide 
set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 9 and SEQ ID NO: 11. 
13. The antibody claim 12, wherein the antibody is monoclonal. 
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I--. A recombinant protein comprising the antigen binding region of a 
monoclonal antibody of claim 13. 

15. The antibody or fragment thereof of claim 13 or 14, or the recombinant 
protein of claim 13, which is labeled with a detectable marker. 
5 16. The antibody or fragment thereof or recombinant protein of claim 15, 
wherein the detectable marker is selected from the group consisting of a 
radioisotope, fluorescent compound, bioluminescent compound, 
chemiluminescent compound, metal chelator or enzyme. 

17. The antibody fragment of claim 13, which is an Fab, F(ab')2, Fv or Sfv 
10 fragment. 

18. A polynucleotide of claim 1 that is labeled with a detectable marker. 

19. An assay for detecting the presence of a polypeptide set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9 and SEQ ID 
NO: 1 1 in a biological sample comprising contacting the sample with an 

15 antibody of claim 12, and detecting the binding of the polypeptide in the 

sample thereto. 

20. An assay for detecting the presence of a polynucleotide as shown in SEQ 
ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 8 and SEQ ID NO: 10 in a biological sample comprising contacting 

20 the sample with a complementary polynucleotide of claim 6, and 

detecting the binding of the complementary polynucleotide in the sample 
thereto. 
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1 


GGCACGAGCT 


GCGATAATAG 


CGAGGCAGCA 


GTGCAGCTTT 


CAGAGGGTCC 


51 


GGGCTCAGAG 


GGGCTATGAT 


TCGGAGGGTT 


CTGCCGCACG 


GCATGGGCCG 


101 


GGGCCTCTTG 


ACCCGGAGGC 


CAGGCACGCG 


CAGAGGAGGC 


TTTTCTCTGG 


151 


ACTGGGATGG 


AAAGGTGTCT 


GAGATTAAGA 


AGAAGATCAA 


GTCGATCCTG 


201 


CCTGGAAGGT 


CCTGTGATCT 


ACTGCAAGAC 


ACCAGCCACC 


TGCCTCCCGA 


251 


GCACTCGGAT 


GTGGTGATCG 


TGGGAGGTGG 


GGTGCTTGGC 


TTGTCTGTGG 


301 


CCTATTGGCT 


GAAGAAGCTG 


GAGAGCAGAC 


GAGGTGCTAT 


TCGAGTGCTA 


351 


GTGGTGGAAC 


GGGACCACAC 


GTATTCACAG 


GCCTCCACCG 


GGCTCTCAGT 


401 


AGGTGGGATT 


TGTCAGCAGT 


TCTCATTGCC 


TGAGAACATC 


CAGCTCTCCC 


451 


TCTTTTCAGC 


CAGCTTTCTA 


CGGAACATCA 


ATGAGTACCT 


GGCCGTAGTC 


501 


GATGCTCCTC 


CCCTGGACCT 


CCGGTTCAAC 


CCCTCGGGCT 


ACCTCTTGCT 


551 


GGCTTCAGAA 


AAGGATGCTG 


CAGCCATGGA 


GAGCAACGTG 


AAAGTGCAGA 


601 


GGCAGGAGGG 


AGCCAAAGTT 


TCTCTGATGT 


CTCCTGATCA 


GCTTCGGAAC 


651 


AAGTTTCCCT 


GGATAAACAC 


AGAGGGAGTG 


GCTTTGGCGT 


CTTATGGGAT 


701 


GGAGGACGAA 


GGTTGGTTTG 


ACCCCTGGTG 


TCTGCTCCAG 


GGGCTTCGGC 


751 


GAAAGGTCCA 


GTCCTTGGGA 


GTCCTTTTCT 


GCCAGGGAGA 


GGTGACACGT 


801 


TTTGTCTCTT 


CATCTCAACG 


CATGTTGACC 


ACAGATGACA 


AAGCGGTGGT 


851 


CTTGAAAAGG 


ATCCATGAAG 


TCCATGTGAA 


GATGGACCGC 


AGCCTGGAGT 


901 


ACCAGCCTGT 


GGAATGCGCC 


ATTGTGATCA 


ACGCAGCCGG 


AGCCTGGTCT 


951 


GCGCAAATCG 


CAGCACTGGC 


TGGTGTTGGA 


GAGGGGCCGC 


CTGGCACCCT 


1001 


GCAGGGCACC 


AAGCTACCTG 


TGGAGCCGAG 


GAAAAGGTAT 


GTGTATGTGT 


1051 


GGCACTGCCC 


CCAGGGACCA 


GGCCTAGAGA 


CTCCGCTTGT 


TGCAGACACC 


1101 


AGTGGAGCCT 


ATTTTCGCCG 


GGAAGGATTA 


GGTAGCAACT 


ACCTAGGTGG 


1151 


TCGTAGCCCC 


ACTGAGCAGG 


AAGAACCGGA 


CCCGGCGAAC 


CTGGAAGTGG 


1201 


ACCATGATTT 


CTTCCAGGAC 


AAGGTGTGGC 


CCCATTTGGC 


CCTGAGGGTC 


1251 


CCAGCTTTTG 


AGACTCTGAA 


GGTTCAGAGC 


GCCTGGGCCG 


GCTATTACGA 


1301 


CTACAACACC 


TTTGACCAGA 


ATGGCGTGGT 


GGGCCCCCAC 


CCGCTAGTTG 


1351 


TCAACATGTA 


CTTTGCTACT 


GGCTTCAGTG 


GTCACGGGCT 


CCAGCAGGCC 


1401 


CCTGGCATTG 


GGCGAGCTGT 


AGCAGAGATG 


GTACTGAAGG 


GCAGGTTCCA 


1451 


GACCATCGAC 


CTGAGCCCCT 


TCCTCTTTAC 


CCGCTTTTAC 


TTGGGAGAGA 



FIG. 1A 
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1501 AGATCCAGGA GAACAACATC ATCTGAGCAT GTGTGCTCTG CACTGGCTCC 

1551 ACTGGCTTGC ATCCTGGCTG TGTTCACAGC CTTGTTTGCT GCTTCCATCT 

. 1601 TCCCCAGTAC TGTGCCAGGC CTTCTCCCCC TGCCCAGTGT CCTCTCCTCT 

1651 CAGGCAGGCC ATTGCACCCA TATGGCTGGG CAGGCACAGG CAGTGAGGCC 

1701 GAGGCCAATA GCGAGTGATG AGCGGGATCC TAGGACTGAT CTGTAGGCCA 

1751 TGCTGATGTC ACCCACCAGG GCAATCCATC TGGAGGCCTG AGCACCCTGG 

1801 CCCAGGACTG GCTTCATCCT GGCACTGACC AGGAAAGACT GCCTCTGACC 

1851 CTCTTAGCAG ACAGAGCCCA GGCATGGGAG CACTCTGGGG CAGCCTGGCT. 

1901 CAGGTTTATT GATTTTCGTC TGTTTACCCT ATCCATTAAT CAATACATGT 

1951 AATTAACTCC TAAAAAAAAA AAAAAAAAAA A 

FIG. 1B 
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1 MIRRVLPHGM GRGLLTRRPG TRRGGFSLDW DGKVSEIKKK IKSILPGRSC 

51 DLLQDTSHLP PEHSDWIVG GGVLGLSVAY WLKKLESRRG AIRVLWERD 

101 HTYSQASTGL SVGGICQQFS LPENIQLSLF SASFLRNINE YLAWDAPPL 

151 DLRFNPSGYL LLASEKDAAA MESNVKVQRQ EGAKVSLMSP DQLRNKFPWI 

201 NTEGVALASY GMEDEGWFDP WCLLQGLRRK VQSLGVLFCQ GEVTRFVSSS 

251 QRMLTTDDKA WLKRIHEVH VKMDRSLEYQ PVECAIVINA AGAWSAQIAA 

301 LAGVGEGPPG TLQGTKLPVE PRKRYVYVWH CPQGPGLETP LVADTSGAYF 

351 RREGLGSNYL GGRSPTEQEE PDPANLEVDH DFFQDKVWPH LALRVPAFET 

401 LKVQSAWAGY YDYNTFDQNG WGPHPLWN MYFATGFSGH GLQQAPGIGR 

451 AVAEMVLKGR FQTIDLSPFL FTRFYLGEKI QENNII 

FIG. 2 
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1 GGCACGAGCG GGGACGGAGC GAGCCGGCGC CAGGGCCCCT CGGGCCGGGA 

51 AGAGGGGAAG GGGAGCGAGG TTGATGCCCG GCGGAGGGGC GAGCGCGGCG 

101 TCTGGCCGGC TTCTCACCGC CGCGGAGCAA AGAGGGTCCC GGGAAGCGGC 

151 AGGGTCGGCG TCCAGGAGCG GCTTCGGGGG CTCCGGCGGC GGCAGAGGCG 

201 GAGCAAGCGG CCCCGGGTCC GGGAGCGGAG GCCCTGGGGG CCCCGCGGGC 

251 AGGATGAGCT TGACCCCGAA GGAGCTCTCG AGCCTGCTGA GCATCATATC 

.301 GGAGGAGGCG GGCGGCGGCA GCACCTTCGA GGGCCTGTCC ACCGCCTTCC 

351 ACCACTACTT CAGCAAGGCC. GACCACTTCC GCCTGGGCTC GGTGCTCGTC 

401 ATGCTGCTCC AGCAGCCCGA CCTGCTGCCT AGCGCGGCGC AGCGCCTCAC 

451 GGCGCTCTAC CTGCTCTGGG AGATGTACCG CACCGAGCCG CTGGCCGCCA 

501 ACCCCTTCGC CGCCAGCT.TC GCGCACCTGC TCAACCCCGC GCCGCCCGCC 

551 CGCGGCGGCC AGGAACCCGA CCGCCCTCCG CTCTCAGGAT TTTTACCTCC 

601 TATAACTCCA CCAGAAAAGT TTTTTCTTTC CCAGCTGATG CTGGCACCCC 

651 CACGGGAACT CTTCAAAAAG ACGCCTCGCC AGATTGCACT GATGGACGTT 

701 GGAAACATGG GCCAGTCTGT GGACATTAGT GGGCTTCAGT TAGCCTTGGC 

751 CGAACGCCAA TCTGAATTGC CAACGCAAAG CAAAGCGAGC TTCCCCAGTA 

801 TTCTCAGTGA CCCAGACCCG GATTCTTCTA ATTCTGGATT TGACAGCTCA 

851 GTTGCCTCTC AGATCACAGA AGCTTTAGTC AGCGGACCAA AGCCACCTAT 

901 TGAAAGCCAT TTTCGACCAG AGTTTATTCG TCCACCGCCT CCACTCCACA 

951 TTTGTGAGGA TGAACTTGCT TGGCTAAACC CCACGGAGCC TGACCACGCG 

1001 ATCCAGTGGG ATAAATCGAT GTGTGTTAAG AATAGCACTG GTGTGGAGAT 

1051 CAAACGAATA ATGGCCAAAG CCTTCAAAAG CCCCTTATCC TCTCCCCAAC 

1101 AAACACAGCT ACTTGGTGAG TTGGAAAAAG ACCCCAAACT TGTCTACCAT 

1151 ATTGGCCTCA CCCCAGCCAA ACTTCCTGAC CTTGTGGAAA ACAACCCTTT 

1201 AGTCGCTATA GAAATGTTGC TGAAATTAAT GCAGTCAAGC CAGATCACTG 

1251 AGTATTTCTC TGTCCTGGTC AATATGGACA TGTCTTTACA TTCAATGGAA 

1301 GTTGTAAATC GACTAACTAC AGCTGTTGAT CTACCTCCTG AATTTATTCA 

1351 CCTTTATATA TCAAATTGCA TCTCTACTTG TGAACAGATT AAGGATAAAT 

1401 ATATGCAGAA TCGGTTGGTG CGTCTTGTGT GTGTGTTTCT CCAATCCTTG 

1451 ATCCGTAACA AAATTATTAA TGTACAGGAT TTGTTTATAG AAGTGCAGGC 

1501 ATTCTGTATT GAATTCAGTA GGATACGAGA AGCTGCTGGT CTTTTCCGGT 

FIG.3A 
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1551 TGTTGAAGAC ATTGGATACT GGGGAAACAC CTTCTGAGAC CAAAATGTCA 

1601 AAATAATACC TCATCAGAAC CATCCCATCC ATTCACTGTT CAGCTGTACT 

1651 GTGATTTAGT TTTTACACCG TTAAAACCCT GAGTGGATTG CTTGGTTTAA 

1701 TGCATATAAA CAGTACTTTA TCTACTTAAA GCAAAAAAAA AAAAAAAAAA 

FIG.3B 
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1 MPGGGASAAS GRLLTMEQR GSREAAGSAS RSGFGGSGGG RGGASGPGSG 

51 SGGPGGPAGR MSLTPKELSS LLSIISEEAG GGSTFEGLST AFHHYFSKAD 

101 HFRLGSVLVM LLQQPDLLPS AAQRLTALYL LWEMYRTEPL AANPFAASFA 

151 HLLNPAPPAR GGQEPDRPPL SGFLPPITPP EKFFLSQLML APPRELFKKT 

201 PRQIALMDVG NMGQSVDISG LQLALAERQS ELPTQSKASF PSILSDPDPD 

251 SSNSGFDSSV ASQITEALVS GPKPPIESHF RPEFIRPPPP LHICEDELAW 

301 LNPTEPDHAI QWDKSMCVKN STGVEIKRIM AKAFKSPLSS PQQTQLLGEL 

351 EKDPKLVYHI GLTPAKLPDL VENNPLVAIE MLLKLMQSSQ ITEYFSVLVN 

401 MDMSLHSMEV VNRLTTAVDL PPEFIHLYIS NCISTCEQIK DKYMQNRLVR 

451 LVCVFLQSLI RNKIINVQDL FIEVQAFCIE FSRIREAAGL FRLLKTLDTG 

501 ETPSETKMSK 

FIG. 4 
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1 GGCACGAGCT GGCTCGCGCG TGCCTTTTCC CCTCAGGTTG TGGGGAGAGC 

51 GGAATCCTGC TCCGCCGTCG CAGCAGCAGC GGCAGCCCCG GCAGCCTCGG 

101 GCGACAGCGG CGGCGCGCGA GCCCCCGGGC GGACCGTACC ACCGCTCGCC 

151 AGCACGCAGG GGGAGCCGCC CGTCTCGCCG CGCACGCCTC GGCGACCCCG 

201 CGGGGCTGAG GCGTCGCCGC GCCCGGCAGC- GTGAGCGCAG AGCCGGCCTC 

251 GACCCCGAGC TCGGAGCCCC GCGGGCCGCG CCCGCCGCCG GCCCCACCCA 

301 TCCGGGTCGA GGAGGCCGAG GCCATGGCTG AGACGGAGGA GCGGAGCCTG 

351 GACAACTTCT TTGCCAAGAG GGACAAGAAG AAGAAGAAGG AGCGGAGCAA 

401 CCGGGCGGCG AGTGCCGCGG GCGCAGCGGG CAGCGCCGGC GGAAGCAGTG 

451 GAGCCGCGGG TGCGGCGGGC GGCGGGGCGG GCGCGGGGAC CCGGCCGGGT 

501 GACGGCGGGA CCGCCAGCGC GGGGGCTGCG GGCCCAGGGG CCGCCACCAA 

551 GGCTGTGACG AAGGACGAAG ATGAATGGAA AGAATTGGAG CAAAAAGAGG 

601 TTGATTACAG CGGCCTCAGG GTTCAGGCAA TGCAAATAAG CAGTGAAAAG 

651 GAAGAAGACG ATAATGAAAA GAGACAAGAT CCAGGTGATA ACTGGGAAGA 

701 AGGTGGAGGT GGTGGTGGAG GTATGGAAAA ATCTTCAGGT CCCTGGAATA 

751 AAACAGCTCC AGTACAAGCA CCTCCTGCTC CAGTAATTGT TACAGAAACC 

• 801 CCAGAACCAG CGATGACTAG TGGTGTGTAT AGGCCTCCTG GGGCCAGGTT 

851 MCCACAACA AGGAAAACAC CACAAGGACC ACCAGAAATC TACAGTGATA 

901 CACAGTTCCC ATCCCTGCAG TCAACTGCCA AGCATGTAGA AAGCCGGAAG 

951 GATAAAGAAA TGGAGAAGAG CTTTGAAGTA GTAAGACACA AAAATAGAGG 

1001 TAGGGATGAG GTTTCAAAAA ACCAGGCCCT TAAACTTCAG CTAGACAACC 

1051 AATATGCTGT GCTTGAAAAT CAGAAAAGCA GCCACTCACA ATACAATTAA 

1101 GGAATGGGCT TTGCTAACCC TTCTGAGGTA ACTAGACTGC AGCTAACCAC 

1151 CACCAACAGC CATTCATCAT CTGATCTCTG CTGGATCTAC AGACACCGAT 

1201 GCAGACCACT CGATTTCATG ACCGGCCCTA TTGCACTATG GAAGTTAAAG 

1251 TGTCACGACT GCTCTATGCA TATTGGATTT AGGGGAATTT TCATTGTTAC 

1301 ATAAATGTGT GAACTAGTTT CAACAGTGTT CTTTCATATT TACTCTGCAA 

1351 ATACAAAAAA CCAAAACCTG CAGCCAGTGG TCATTTCAAA ATCTTTTTAT 

1401 GTTCAGATAC TGAGCCTTCA TAAGGGTTGA CTACCTCAGA TTTGCTGCAC 

1451 TCATTGTGGA CTTCATGTGG ATCACAACTT CTGGATAAGA AGATTACAAC 

1501 TATTAAGTGT CGATGTGAAC CTTGCAACCA GCTCTACTGG ATTCTTATCA 

FIG. 5A 
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1551 GAAATCCTGC ATAAAAAGTC AGCCATCTGG GTTCTGATCT GCTGTAAAAG 

1601 ATGAAGATTT AAGTGACCTT AATTAACCTG TCCTGTGCCC TACCCTTAAG 

1651 GAATACTCTC TGTAGTAGGC TGTTGTTATA TTAGACTTCC TGGAACACAC 

1701 CGCTGAAAAG AACTGATGTG TTCAGATCAT CTGTGTAGGG CTGTGATTTG 

1751 TAATTTAAAC TAATTGTATT CTGAGGTAAC CACAAAATAA ATTCAACCAA 

1801 ACTGGGGTCC ACCAAGTGGG GGAAGGGGAA GGGAGAGAAT AATCTTGGGG 

1851 GTTTTTTTTT TTGGTAATTT TTTTATTTGG ATAGTGCTTT TTTGTTTTGT 

1901 TTTTGTTCTG CATTAAGGCC TTTTTTGCTT TGACTTGAM TAAGTTCTTT 

1951 GACAGAGCAT ATTGCTTGGT TAATTAAGTA ACCTAAAGTA TGCATTAGGA 

2001 TTGTGAAATG TCTCGTGAGT ATGCCAATCC TGAGGGTGGA ACCAAATAGC 

2051 GTTTGATGAA AAGGGCAGTG GATTCTGGAG GCTCTACTTC AGGTGCTGCT 

2101 ATAATGCCTC ATCTAATCAG GACTAAATTG TGTAGGAAAC TGCAGTGGGA 

2151 AGAATATGCT TTCTGCTCAG GCTAAGAGGG TCACTGATCT GTCCTTAGAA 

2201 ATTCAGAGTA ACATGAGCAA AACCTCAGCT AAAACCCATT TAAGTGGCAT 

2251 GGATTGTGCA TGATCTTTGA TAAGMTTCC TCATGTACTT GTGCCTAGTT 

2301 TTTCAAGGTA TTGGCTGTTC TATAGATGCA GTGATTGTCC CAGCTAGCTC 

2351 TGTTACCAGC CTTTTGGTGT GTCTTTATGT TCATTTGGAG AGTCAGGGCG 

2401 AAAGACAGGT GATGTAGCAC TTCTGTTTTT AATAATTATT GCTTAAAATA 

2451 CCTATTAATA GTTTTGGGTC ATTTAAAGGG ACTTGAGGAA GCTACCCAGG 

2501 ATTACAGAAG AGTGTGCACC TAACAAGATG GTCTGGCAGT TTCCTAGTTT 

2551 TGTATCTGGT TCAATAGAAA TATGTGAAAG TGGTAATGTC ATCATTTGAT 

2601 GCAGAGTCCG GGTTTCTCTA TAATAAATCC CTTTGCCAAA . TGCATGAGTT 

2651 GCAGACTTGC TACTGGCAAG AGTGAAGCAA GTGGGTGAGT AAAACTATTT 

2701 TGACGTGGGA GCGTTTTCAG ATAGGAGTTT AGTCTTGACG AAAGTGTCCG 

2751 TGCAGGAATT GGACTCCGAG GAGGGTTACA GTATCTCCTG ACGGGACCTG 

2801 CCACTCGCAT CTGGGCAATG TTGACATTTG AGGTGGCAGG CAGGATGCCT 

2851 GCCTTCTAAT ATATTTGGGT GAGTAACTGA GGCAGCCAAG GGAAGGTTGA 

2901 ATGATTAAAT CAGAAATGGG ATTCTTGGTA AACTGAAGAC TTTTATTTGG 

2951 GAATGAAAAA CCTTAAAAM ATCTCTTCAT CGTTGAACTG TGCATTTTCC 

3001 CTGCATTTTT TCCCAACAAA ATTTTGTTGG GGGTTATGTT ACTGAAGAAT 

.3051 GAACAGATGA GTAAGTGGAG GTGTTATGTA AAGGCATATT GTACTCGAAA 

3101 TCTGAAGACC TGCAGCAGAT TTAAATTACA ACTCTTGTTA TAACTTTTTA 

3151 AAAGATTGTG AAAATATCAA AATATAAATG AATCAAGTTT TAATATACTG 

3201 TATGATGGGT GGATGAGGCT GTCCATTGTA CCATTTGTTT GAATTCTCAG 

3251 GCATGGTTTG GCAGTGCAAG AATTCTGTAA CATTAACAAA TTCAATAAAA 

3301 AGTAAATATA TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

FIG.5B 
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1 MAETEERSLD NFFAKRDKKK KKERSNRAAS MGAAGSAGG SSGAAGAAGG 

51 GAGAGTRPGD GGTASAGAAG PGAATKAVTK DEDEWKELEQ KEVDYSGLRV 

101 QAMQISSEKE EDDNEKRQDP GDNWEEGGGG GGGMEKSSGP WNKTAPVQAP 

151 PAPVIVTETP EPAMTSGVYR PPGARLTTTR KTPQGPPEIY SDTQFPSLQS 

201 TAKHVESRKD KEMEKSFEW RHKNRGRDEV SKNQALKLQL DNQYAVLENQ 

251 KSSHSQYN 

FIG. 6 
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GGCAC GAGGTGGCAT AGCATAACCA CAGTAAGAAC 

AGAACAGATA TTCAGCAGAA AACTTTTTAT ACTCTAATTC TTTTTTTTTT 
TTTTTTTGAG ACAGAGTTTT AGTCTTGTTT CCCAGGCTGG AGTGCAATGG 
CACAATCTTG GCTCACTGCA ACCTCCGCCT CCTGGGTTCA GGCAATTTTC 
CTGCCTCAGC CTCCCAAGTA GCTGGGATTA CAGGCACCCA CCACCATGCC 
CAGCTAATTT TTGTATTTTT AATAGAGAGC TAATAATTGT ATATTTAATA 
AAGACGGGTT TCACCATGTT GGCCAGGCTG GTCTTGAACT CCTGACCTCA 
GGTGATCCTC CTGCATTGGC CTCCCAAAGT GCTGGAATTC CAGGCATGAG 
CCACTGCGCC CAGTCTACAC ACTAATTCTT GTTAGCCCAA CAGCTGTTCT 
GTTCTATCTA CCCCTCATTT CACGCTCAAG GAGTCATACC TAGAATAGTT 
ACACACAAGA GGGAAACTGG AAGCCAAACA CTGTACAGTA TTGTGTAGAA 
AGTCACCTCC CTACTCCTTT TATTTTACAT GAGTGCTGAT GTGTTTTGGC 
AGATGAGCTT TCAGCTGAGG CCTGATGGAA ATTGAGATAA CCTGCAAAGA 
CATAACAGTA TTTATGAGTT ATATCTTAGT TCTTGAAATT GTGGAATGCA 
TGATTGACAA TATATTTTTA ATTTTTATTT TTTCAAGTAA TACCAGTACT 
GTTTAACTAT AGCCAGAACT GGCTAAAATT TTTATATTTT CAGAGTTGAA 
GTTGGTGAAG ACATTCATGA TTTAAACACC AGATCCTGAA AGGGGTTAAA 
TCTACTTTGA AATGAATCTG CAATCAGTAT TTCAAAGCTT TTCTGGTAAT 
TTTAGTGATC TTATTTGATT AGACTTTTTC AGAAGTACTA AATAAGGAAT 
TTTAACAGGT TTTTATTAAT GCACAGATAA ATAGAAGTAC AGTGAGGTCT . 
ATAGCCATTT TATTAAAAtA GCTTAAAAGT TTGTAAAAAA ATGAATCTTT 
GTAATTACTT AATATGTTAG TTAAGAACCC GTCAAGCTTA TATTTGCTAG 
ACTTACAAAT TATTTTAAAT GCATTTATCT TTTTTGACAC TATTCAGTGG 
AATGTGTAAG CTAGCTAATT CTTGTTTTCT GATTTAAAGC ACTTTTAAAT 
CTTATCCTGC CCCCTAAAAA CAAAAGGTTT TGATCACAAG GGGAAATTTA 
AGATTGTTAA CCCTGTTTTT CAGAAGGGCT ACTGTTAATT GCACATAAAC 
ATGAAATGTG TTTTCCCCTG TGTACTAACA CATTCTAGGC AAAATTCAAA 
CTTATAGTGG TAAAGAAACA GGTTGTTCAC TTGCTGAGGT GCAAAAATTC 
TTAAGACTTC TGTTTGAAAT TGCTCAATGA CTAGGAAAAG ATGTAGTAGT 
TTACTAAAAT TGTTTTTCTA CCATATCAAA TTAAACAATT CATGCCTTTT 
TAGGGTCAGG CCTACAATGA ATAGGTATGG TGGTTTCACA GAATTTTAAA 
ATAGAGTTAA AGGGAAGTGA TGTACATTTC GGGGGCATTA GGGTAGGGAG 
ATGAATCAAA AAATACCCCT AGTAATGCTT TATATTTTAA TACTGCAAAA 
GCTTTACAAA TGGAAACCAT GCAATTACCT GCCTTAGTTC TTTTGTCATA 
AAAACAATCA CTTGGTTGGT TGTATTGTAG CTATTACTTA TACAGCAACA 
TTTCTTCAAT TAGCAGTCTA GACATTTTAT AAACAGAAAT CTTGGACCAA 
TTGATAATAT TTCTGACTGT ATTAATATTT TAGTGCTATA AAATACTATG 
TGAATCTCTT AAAAATCTGA CATTTTACAG TCTGTATTAG ACATACTGTT 
TTTATAATGT TTTACTTCTG CCTTAAGATT TAGGTTTTTT AAATGTATTT 
TTGCCCTGAA TTAAGTGTTA ATTTGATGGA AACTCTGCTT TTAAAATCAT 
CATTTACTGG GTTCTMTAA ATTAAAAATT AAACTTGTAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 

FIG. 7 
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1 GGCACGAGGT CCCACCTTGT GGAGGATGGA GGTGACCGGG GACGCCGGGG 



51 


TACCAGAATC 


TGGCGAGATC 


CGGACTCTAA 


AGCCGTGTCT 


GCTGCGCCGC 


101 


AACTACAGCC 


GCGAACAGCA 


CGGCGTGGCC 


GCCTCCTGCC 


TCGAAGACCT 


151 


GAGGAGCAAG 


GCCTGTGAGA 


TTCTGGCCAT 


TGATAAGTCC 


CTGACACCAG 


201 


TCACCCTGGT 


CCTGGCAGAG 


GATGGCACCA 


TAGTGGATGA 


TGACGATTAC 


251 


TTTCTGTGTC 


TACCTTCCAA 


TACTAAGTTT 


GTGGCATTGG 


CTAGTAATGA 


301 


GAAATGGGCA 


TACAACAATT 


CAGATGGAGG 


TACAGCTTGG 


ATTTCCCAAG 


351 


AGTCCTTTGA 


TGTAGATGAA 


ACAGACAGCG 


GGGCAGGGTT 


GAAGTGGAAG 


401 


AATGTGGCCA 


GGCAGCTGAA 


AGAAGATCTG 


TCCAGCATCA 


TCCTCCTATC 


451 


AGAGGAGGAC 


CTCCAGATGC 


TTGTTGACGC 


TCCCTGCTCA 


GACCTGGCTC 


501 


AGGAACTACG 


TCAGAGTTGT 


GCCACCGTCC 


AGCGGCTGCA 


GCACACACTC 


551 


CAACAGGTGC 


TTGACCAAAG 


AGAGGAAGTG 


CGTCAGTCCA 


AGCAGCTCCT 


601 


GCAGCTGTAC 


CTCCAGGCTT 


TGGAGAAAGA 


GGGCAGCCTC 


TTGTCAAAGC 


651 


AGGAAGAGTC 


CAAAGCTGCC 


TTTGGTGAGG 


AGGTGGATGC 


AGTAGACACG 


701 


GGTATCAGCA 


GAGAGACCTC 


CTCGGACGTT 


GCGCTGGCGA 


GCCACATCCT 


751 


TACTGCACTG 


AGGGAGAAGC 


AGGCTCCAGA 


GCTGAGCTTA 


TCTAGTCAGG 


801 


ATTTGGAGGT 


GGGCGGAAAC 


CAGGGTCACT 


GAGCTACAGA 


GGAGGACATG 


851 


CCCTGGGATG 


TAGTAGTATC 


ATGCAGAGGT 


GTGTGGGCCC 


TTTTGTTCAC 


901 


CTCTGCAGAC 


TGTGAATCCT 


AGCTGCCAGT 


TTGCCTATTA 


TATGCCAAGG 


951 


CATTTGCAAA 


AATCTCATTA 


ATCTAAATCA 


AAATAGCTTT 


AAAGAAAAAT 


1001 


GCAAAAAAAA 


AAAAAAAAAA 


AAAAAAA 
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MEVTGDAGVP ESGEIRTLKP CLLRRNYSRE QHGVMSCLE DLRSKACDIL 

51 AIDKSLTPVT LVLAEDGTIV DDDDYFLCLP SNTKFVALAS NEKWAYNNSD 
101 GGTAWISQES FDVDETDSGA GLKWKNVARQ LKEDLSSIIL LSEEDLQMLV 
151 DAPCSDLAQE LRQSCATVQR LQHTLQQVLD QREEVRQSKQ LLQLYLQALE 
201 KEGSLLSKQE ESKAAFGEEV DAVDTGISRE TSSDVALASH ILTALREKQA 
251 PELSLSSQDL EVGGNQGH 



FIG. 9 
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1 GGCACGAGGC TAAATGTAGA CAATGGTTAG AGAAGAATTT TCCAAATGAA 



51 


TTTGCAAAAC 


TTACTGTAGA 


AAATTCACCC 


AAACAAGAAG 


CTGGAATTAG 


101 


TGAGGGTCAA 


GGAACAGCAG 


GGGAAGAAGA 


GGAGAAGAAA 


AAACAGAAGA 


151 


GAGGTGGAAG 


GGGTCAAATA 


AAACAAAAAA 


AGAAGACCGT 


ACCACAAAAG 


201 


GTTACTATAG 


CCAAAATTCC 


CAGAGCAAAG 


AAGAAATATG 


TGACAAGAGT 


251 


ATGTGGCCTT 


GCAACTTTTG 


AAATTGATCT 


TAAAGAAGCA 


CAAAGATTTT 


301 


TTGCTCAAAA 


ATTCTCCTGT 


GGTGCCTCAG 


TAACAGGGGA 


GGATGAAATT 


351 


ATCATTCAGG 


GAGATTTTAC 


AGATGACATA 


ATTGATGTCA 


TTCAGGAAAA 


401 


ATGGCCAGAG 


GTAGATGATG 


ACAGCATCGA 


AGATCTTGGA 


GAAGTAAAGA 


451 


AGTGAATTTG 


AAAATTTGTC 


TGTATTTAAT 


GGCCTGAACT 


GAGAGTTGAT 


501 


ATGGCCAAAG 


GGAGAGAGGC 


CTTTTAAAAT 


ATATATATAT 


ATACACATAT 


551 


ATATGTATAT 


ATACACATAT 


ATGTATGTAT 


ACACATATAC 


ACATGTATAT 


601 


ATACATGTGT 


GTATGTATGC 


ATGTATATAC 


ATATATACAT 


ACACATATAT 


651 


GTATACATAT 


ATACACATAT 


ATGTATACAT 


ATATACACAT 


ATATGTATAC 


701 


ATATATATAT 


ATTCTACAGT 


AAAACTGTAG 


ACTGTCCTCG 


TCCTTGGCAT 


751 


TTTCACTGTT 


CTGTACAAGG 


CTGCTTGTTT 


TTTTATTGCC 


AAAGTCAAAT 


. 801 


AAACGGGAGA 


CTGTCATGCT 


CATGCATGAA 


TAGAATTTAG 


TCAAATAAAA 


851 


AATTTTGGTC 


ATTTGGTACT 


GACTTTCTCT 


CTCTCTCTCT 


CTCTCTTTTT 


901 


TTTTTTTGAC 


AGAGTCTCGC 


ACTGTTGCCT 


GGGCTGGAAT 


GCAGTGGTGC 


951 


GATCTCGGCT 


CACTGCAACC 


TCCGCCTCCC 


GGGTTCAAGT 


GATTCTCCTG 


1001 


CCTCAGCCTC 


CCAAGTAGGT 


GGGATTACAG 


GCGCCCGCCA 


CCACGCCCAG 


1051 


CTAATTTTTG 


TATTTTTAGT 


AGAGATGGGG 


TTTCACTATG 


TTGGCCAGGC 


110.1 . 


TGGTCTCGAA 


CTCCTGACCT 


CGTGATTGGC 


CCACCTCAGC 


CTCCCAAAGT 


1151 


GCTGGTATTA 


CAGATGTGAG 


CCACCGCACC 


CAGCCTGAGT 


TTCTCTTTCT 


1201 


• CTCTTTTTAA 


CTTTATTTTT 


TGAAAAACCC 


GGTAGACTTT 


GTGGGGAGCA 


1251 


TTTTTGTTGA 


TAATTTTACT 


GATCTAAAGC 


TGAGTGATTT 


TTTAAAAGAA 


1301 


TTTGAATTTG 


GCTTCCTCAC 


CAGTAATATG 


TCTCCTTGCT 


TCTTTGATGT 


1351 


GATAGTTTTG 


AGATGGGTGA 


GAATCTAATA 


GATCTGTGGT 


TGAATTTGCT 


1401 


GTGTTGTTAT 


GAAGTCCACC 


CTGTGGGCAC 


AATAACATAA 


CTGTTGGTAG 


1451 


GAGTTGTTTG 


AGCTATTCTG 


GAGATTATTT 


GGTAAAGTAT 


ACTAAAAGCC 


1501 


TTAAAACCAT 


GTATGTGCGC 


TGTTTGAACC 


AGTAAGCCAC 


TTCTTTGACA 


1551 


TTAGAAGACA 


TTAGAAGAAA 


TAATCAGCCT 


TGCATAAAAC 


TTATGGATGA 
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1 fifll 

IDUx 


AAfyFATTPAT 


PAPAATATTA 


1 

10 Ji. 


THAAPAATTH 


PPAAATPPTT 

VJVJrtrtrV l \J\J L i 


1 /UI 


1A1 1A10LA1 


Albl 1 iAAAb 


1751 


TGTTTGGTAA 


TGTCAAGTGG 


1801 


GTCATCATCT 


GCTCTGAGTG 


1851 


ATTCTGGCTA 


TAAATTATGT. 


1901 


CTCTACACCC 


ATGCTCAATA 


1951 


TCTTCTCACA 


AGAGAAAATG 


2001 


AACTTTAAGT 


CAGGTGCTGC 


2051 


GTTGCTGTGG 


ACACTTTTAA 


2101 


TCGCCATGCA 


AATTACAATC 


2151 


TTAGAAAAAT 


GTGTAGTAAA 


2201 


AAAAAAAAAA 


AAAA 



TTTATAATAA AAAATTGCAA ATGTTATAAA 
AAAGAAGTGA TGGTGCATTG TGTGGTAGAA 
AATCATATTT TCTAAGATTA TTTGGAAGCA 
AGTACCCCAG ATACATTTTA GACATTTATC 
GAAGGCCGTT CAGAGAGGCT AGAGGTTCTT 
GAGTAAAATT GTGCTAACCA GTTAAAAGTA 
TATAGTCCTG GAAATAGCAA TTGAAACATG 
ACAGTTTTAA TGATGTATTT GATGAATTTA 
AAATTGGAAA GAAGACTTGT GGTGTTTTAA 
GAAACTTAGA ACCCATGGAA CCCTTGTTTA 
TTGAATGAGT GTTTTTTTAA AAATAAAGTA 
GATGTAAAAT TAAAAAAAAA AAAAAAAAAA 
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1 . HEAKCRQWLE KNFPNEFAKL TVENSPKQEA GISEQQ6TAG EEEEKKKQKR 
51 GGRGQIKQKK KTVPQKVTIA KIPRAKKKYV TRVCGLATFE IDLKEAQRFF 
101 AQKFSCGASV TGEDEIIIQG DFTDDIIDVI QEKWPEVDDD SIEDLGEVKK 



FIG. 11 



SUBSTITUTE SHEET (RULE 26) 



WO 01/25250 



BEST AVAILABLE COPY 

PCT/US00/27649 



C H 




16/21 

C H 




C H 




H31 
C H 



1.8- 



kb : .^.^ 




H45 



C H 




H63 





FIG. 12 



SUBSTITUTE SHEET (RULE 26) 



BEST AVAILABLE COPY 

WO 01/25250 PCT/US00/27649 

17/21 



CONTROL C40 H13 H17 H41 H37 
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SEQUENCE LISTING 



<110> Dennis J. Slamon 
Juliana J. Oh 

<120> DIFFERENTIALLY EXPRESSED GENES 

ASSOCIATED WITH HER-2/NEU OVE REX PRESS ION 



<130> 30448, 79USU1 

<150> US 60/157,923 
<151> 199S-10-06 

<160> 14 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1981 

<212> DNA 

<213> Homo Sapiens 

<220> 
<221> CDS 

<222> (66) . . . (1526) 



<400> 1 

ggcacgagct gcgataatag cgaggcagca gtgcagcttt cagagggtcc gggctcagag 
gggct atg att egg agg gtt ctg ccg cac ggc atg ggc egg ggc etc ttg 
Met He Arg Arg Val Leu Pro His Gly Met Gly Arg Gly Leu Leu 
1 5 10 15 

acc egg agg cca ggc acg cgc aga gga ggc ttt tct ctg gac tgg gat 
Thr Arg Arg Pro Gly Thr Arg Arg Gly Gly Phe Ser Leu Asp Trp Asp 
20 25 30 

gga aag gtg tct gag att aag aag aag ate aag teg ate ctg cct gga 
Gly Lys Val Ser Glu He Lys Lys Lys He Lys Ser He Leu Pro Gly 
35 40 . 45 

agg tec tgt gat eta ctg caa gac acc age cac ctg cct ccc gag cac 
Arg Ser Cys Asp Leu Leu Gin Asp Thr Ser His Leu Pro Pro Glu His 
50 55 60 

teg gat gtg gtg ate gtg gga ggt ggg gtg ctt ggc ttg tct gtg gee 
Ser Asp Val Val He Val Gly Gly Gly Val Leu Gly Leu Ser Val Ala 
65 70 75 

tat tgg ctg aag aag ctg gag age aga cga ggt get att cga gtg eta 
Tyr Trp Leu Lys Lys Leu Glu Ser Arg Arg Gly Ala He Arg Val Leu 
80 85 90 95 

gtg gtg gaa egg gac cac acg tat tea cag gee tec acc ggg etc tea 
Val Val Glu Arg Asp His Thr Tyr Ser Gin Ala Ser Thr Gly Leu Ser 
100 105 HO 

gta ggt ggg att tgt cag cag ttc tea ttg cct gag aac ate cag etc 
Val Gly Gly He Cys Gin Gin Phe Ser Leu Pro Glu Asn lie Gin Leu 
115 120 125 

tec etc ttt tea gee age ttt eta egg aac ate aat gag tac ctg gee 
Ser Leu Phe Ser Ala Ser Phe Leu Arg Asn He Asn Glu Tyr Leu Ala 
130 135 140 

gta gtc gat get cct ccc ctg gac etc egg ttc aac ccc teg ggc tac 
Val Val Asp Ala Pro Pro Leu Asp Leu Arg Phe Asn Pro Ser Gly Tyr 



60 
110 



158 



206 



254 



302 



350 



398 



446 



494 



542 
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2 

.145 1 150 155 

etc ttg ctg get tea gaa aag gat get gca gee atg gag age aac gtg 590 
Leu Leu Leu Ala Ser Glu Lys Asp Ala Ala Ala Met Glu Ser Asn Val 
160 165 170 175 

aaa gtg cag agg cag gag gga gec aaa gtt tct ctg atg tct cct gat 638 
Lys Val Gin Arg Gin Glu Gly Ala Lys Val Ser Leu Met Ser Pro Asp 
180 185 190 

cag ctt egg aac aag ttt ccc tgg ata aac aca gag gga gtg get ttg 686 
Gin Leu Arg Asn Lys Phe Pro Trp lie Asn Thr Glu Gly Val Ala Leu 
195 200 205 

gcg tct tat ggg atg gag gac gaa ggt tgg ttt gac ccc tgg tgt ctg 734 
Ala Ser Tyr Gly Met Glu Asp Glu Gly Trp Phe Asp Pro Trp Cys Leu 
210 215 220 

etc cag ggg ctt egg cga aag gtc cag tec ttg gga gtc ctt ttc tgc 782 
Leu Gin Gly Leu Arg Arg Lys Val Gin Ser Leu Gly Val Leu Phe Cys 
225 230 235 

cag gga gag gtg aca cgt ttt gtc tct tea tct caa cgc atg ttg acc 830 
Gin Gly Glu Val Thr Arg Phe Val Ser Ser Ser Gin Arg Met Leu Thr 
240 . 245 250 255 

aca gat gac aaa gcg gtg gtc ttg aaa agg ate cat gaa gtc cat gtg 878 
Thr Asp Asp Lys Ala Val Val Leu Lys Arg lie His Glu Val His Val 
260 265 . 270 



aag atg gac cgc age ctg gag tac cag cct gtg gaa tgc gee att gtg 
Lys Met Asp Arg Ser Leu Glu Tyr Gin Pro Val Glu Cys Ala He Val 
275 280 285 . 



926 



ate aac gca gee gga gee tgg tct gcg caa ate gca gca ctg get ggt 974 

He Asn Ala Ala Gly Ala Trp Ser Ala Gin He Ala Ala Leu Ala Gly 

290 295 300 

gtt gga gag ggg ccg cct ggc acc ctg cag ggc acc aag eta cct gtg 1022 

Val Gly Glu Gly Pro Pro Gly Thr Leu Gin Gly Thr Lys Leu Pro Val 
305 310 315 

gag ccg agg aaa agg tat gtg tat gtg tgg cac tgc ccc cag gga cca 1070 

Glu Pro Arg Lys Arg Tyr Val Tyr Val Trp His Cys Pro Gin Gly Pro 
320 325 330 335 

ggc eta gag act ccg ctt gtt gca gac acc agt gga gee tat ttt cgc 1118 

Gly Leu Glu Thr Pro Leu Val Ala Asp Thr Ser Gly Ala Tyr -Phe Arg 
340 345 350 

egg gaa gga tta ggt age aac tac eta ggt ggt cgt age ccc act gag 1166 

Arg Glu Gly Leu Gly Ser Asn Tyr Leu Gly Gly Arg Ser Pro Thr Glu 
355 360 365 

cag gaa gaa ccg gac ccg gcg aac ctg gaa gtg gac cat gat ttc ttc 1214 

Gin Glu Glu Pro Asp Pro Ala Asn Leu Glu Val Asp His Asp Phe Phe 

370 375 380 

cag gac aag gtg tgg ccc cat ttg gee ctg agg gtc cca get ttt gag 1262 

Gin Asp Lys Val Trp Pro His Leu Ala Leu Arg Val Pro Ala Phe Glu 
385 390 395 

act ctg aag gtt cag age gec tgg gec ggc tat tac gac tac aac acc 1310 

Thr Leu Lys Val Gin Ser Ala Trp Ala Gly Tyr Tyr Asp Tyr Asn Thr 
400 405 410 415 

ttt gac cag aat ggc gtg gtg ggc ccc cac ccg eta gtt gtc aac atg 1358 

Phe Asp Gin Asn Gly Val Val Gly Pro His Pro Leu Val Val Asn Met 
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tac ttt get act ggc ttc agt ggt cac ggg etc cag cag gec cct ggc 1406 
Tyr Phe Ala Thr Gly Phe Ser Gly His Gly Leu Gin Gin Ala Pro Gly 
435 440 445 

att ggg cga get gta gca gag atg gta ctg aag ggc agg ttc cag ace 1454 
He Gly Arg Ala Val Ala Glu Met Val Leu Lys Gly Arg Phe Gin Thr 
450 455 460 

ate gac ctg age ccc ttc etc ttt acc cgc ttt tac ttg gga gag aag 1502 
He Asp Leu Ser Pro Phe Leu Phe Thr Arg Phe Tyr Leu Gly Glu Lys 
465 470 475 



ate cag gag aac aac ate ate tga gcatgtgtgc tetgeactgg ctccactggc 1556 
He Gin Glu Asn Asn He He * 
480 485 

ttgcatcctg gctgtgttca cagccttgtt tgctgcttcc atcttcccca gtactgtgcc 1616 

aggccttctc cccctcccca gtgtcetctc ctctcaggca ggecattgea cccatatggc 1676 

tgggcaggca caggcagtga ggccgaggcc aatagcgagt gatgageggg atcctaggac 17 36 

tgatctgtag cccatgctga tgtcacccac cagggcaatc catctggagg cctgagcacc 1796 

ctggcccagg actggcttca tcctggcact gaccaggaaa gactgcctct gaccctctta 1856 

gcagacagag cccaggcatg ggagcactct ggggcagect ggctcaggtt tattgatttt 1916 

cgtctgttta ccctatccat taatcaatac atgtaattaa ctcctaaaaa aaaaaaaaaa 1976 

aaaaa 1981 



<210> 2 

<211> 486 

<212> PRT 

<213> Homo Sapiens 



<400> 2 



Met 


lie 


Arg Arg 


Val 


Leu 


Pro 


His 


Gly 


Met 


Gly 


Arg 


Gly 


Leu 


Leu 


Thr 


1 








5 










10 










15 




Arg 


Arg 


Pro Gly 


Thr 


Arg 


Arg 


Gly 


Gly 


Phe 


Ser 


Leu 


Asp 


Trp 


Asp 


Gly 








20 










25 










30 






Lys 


Val 


Ser 


Glu 


He 


Lys 


Lys 


Lys 


He 


Lys 


Ser 


He 


Leu 


Pro 


Gly 


Arg 






35 










40 










45 








Ser 


Cys 


Asp 


Leu 


Leu 


Gin 


Asp 


Thr 


Ser 


His 


Leu 


Pro 


Pro 


Glu 


His 


Ser 




50 










55 










60 










Asp 


Val 


Val 


He 


Val 


Gly 


Gly 


Gly 


Val 


Leu 


Gly 


Leu 


Ser 


Val 


Ala 


Tyr 


65 










70 










75 










80 


Trp 


Leu 


Lys 


Lys 


Leu 


Glu 


Ser 


Arg 


Arg 


Gly 


Ala 


He 


Arg 


Val 


Leu 


Val 










85 










90 










95 




Val 


Glu 


Arg 


Asp 


His 


Thr 


Tyr 


Ser 


Gin 


Ala 


Ser 


Thr 


Gly 


Leu 


Ser 


Val 








100 










105 










110 






Gly 


Gly 


He 


Cys 


Gin 


Gin 


Phe 


Ser 


Leu 


Pro 


Glu 


Asn 


He 


Gin 


Leu 


Ser 






115 










120 










125 








Leu 


Phe 


Ser 


Ala 


Ser 


Phe 


Leu 


Arg 


Asn 


He 


Asn 


Glu 


Tyr 


Leu 


Ala 


Val 




130 










135 










140 










Val 


Asp 


Ala 


Pro 


Pro 


Leu 


Asp 


Leu 


Arg 


Phe 


Asn 


Pro 


Ser 


Gly 


Tyr 


Leu 


145 










150 










155 










160 


Leu 


Leu 


Ala 


Ser 


Glu 


Lys 


Asp 


Ala 


Ala 


Ala 


Met 


Glu 


Ser 


Asn 


Val 


Lys 










165 










170 










175 


val 


Gin 


Arg Gin 


Glu 


Gly 


Ala 


Lys 


Val 


Ser 


Leu 


Met 


Ser 


Pro 


Asp 


Gin 








180 










185 










190 






Leu 


Arg 


Asn 


Lys 


Phe 


Pro 


Trp 


He 


Asn 


Thr 


Glu 


Gly 


Val 


Ala 


Leu 


Ala 






195 










200 










205 








Ser 


Tyr 


Gly Met 


Glu 


Asp 


Glu 


Gly 


Trp 


Phe 


Asp 


Pro 


Trp 


Cys 


Leu 


Leu 




210 










215 










220 










Gin 


Gly 


Leu Arg 


Arg 


Lys 


Val 


Gin 


Ser 


Leu 


Gly 


Val 


Leu 


Phe 


Cys 


Gin 


225 










230 










235 










240 


Gly 


Glu 


Val 


Thr 


Arg 


Phe 


Val 


Ser 


Ser 


Ser 


Gin 


Arg 


Met 


Leu 


Thr 


Thr 










245 










250 










255 




Asp 


Asp 


Lys Ala 


Val 


Val 


Leu 


Lys 


Arg 


He 


His 


Glu 


Val 


His 


Val 


. Lys 








260 










265 










270 






Met 


Asp 


Arg 


Ser 


Leu 


Glu 


Tyr 


Gin 


Pro 


Val 


Glu 


Cys 


Ala 


He 


Val 


He 
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275 






280 




Asn 


Ala 


Ala 


Gly Ala Trp Ser 


Ala 


Gin lie 




290 






295 






Glv 


Glu 


Gly 


Pro Pro Gly Thr Leu' Gin Gly 


305 








310 






Pro 


Arg 


Lys 


Arg Tyr Val Tyr 


Val 


Trp His 








325 






330 


Leu 
eu 


Glu 


Thr 


Pro Leu 


Val Ala 


Asp 


Thr Ser 








340 






345 


Glu 


Glv 


Leu 


Gly Ser Asn Tyr 


Leu 


Gly Gly 






355 






360 




Glu 


Glu 


Pro 


Asp Pro 


Ala Asn 


Leu 


Glu Val 




370 






375 






Asp 


Lys 


Val 


Trp Pro 


His Leu 


Ala 


Leu Arg 


385 








390 






Leu 


Lys 


Val 


Gin Ser 


Ala Trp 


Ala 


Gly Tyr 








405 






410 


Asp 


Gin 


Asn 


Gly Val 


Val Gly 


Pro 


His Pro 








4 20 






425 


Phe 


Ala 


Thr 


Gly Phe 


Ser Gly 


His 


Gly Leu 






435 






440 




Gly 


Arg 


Ala 


Val Ala 


Glu Met 


Val 


Leu Lys 




450 






4 55 






Asp 


Leu 


Ser 


Pro Phe 


Leu Phe 


Thr 


Arg Phe 


465 








470 






Gin 


Glu 


Asn 


Asn lie 


lie 







485 

<210> 3 

<211> 1750 

<212> DNA 

<213> Homo Sapiens 



PCT/USOO/27649 

285 



Ala 


Ala Leu Ala Gly 


Val 




300 






Thr 


Lys Leu Pro 


Val 


Glu 


315 




320 


Cvs 


Pro Gin Gly 


Pro 


Glv 






335 




Glv 


Ala Tyr Phe 


Arg 


Arg 




350 






Arg 


Ser Pro Thr 


Glu 


Gin 




365 






Asd 


His Asp Phe 


Phe 


Gin 




380 






Val 


Pro Ala Phe 


Glu 


Thr 


395 






400 


Tyr 


Asp Tyr Asn Thr 


Phe 






415 




Leu 


Val Val Asn 


Met 


Tyr 




430 






Gin 


Gin Ala Pro Gly 


He 




445 






Gly 


Arg Phe Gin 


Thr 


He 




4 60 






Tyr 


Leu Gly Glu 


Lys 


He 


475 






480 



<400> 3 

ggcacgagcg gggacggagc gagccggcgc cagggcccct cgggccggga agaggggaag 60 

gggagcgagg ctgatgcccg gcggaggggc gagcgcggcg tctggccggc ttctcaccgc 120 

cgcggagcaa agagggtccc gggaagcggc agggtcggcg tccaggagcg gcttcggggg 180 

ctccggcggc ggcagaggcg gagcaagcgg ccccgggtcc gggagcggag gccctggggg 240 

ccccgcgggc aggatgagct tgaccccgaa ggagctctcq agcctgctga gcatcatatc 300 

ggaggaggcg ggcggcggca gcaccttcga gggcctgtcc accgccttcc accactactt 360 

cagcaaggcc gaccacttcc gcctgggctc ggtgctcgtc atgctgctcc agcagcccga 420 

cctgctgcct agcgcggcgc agcgcctcac ggcgctctac ctgctctggg agatgtaccg 480 

caccgagccg ctggccgcca accccttcgc cgccagcttc gcgcacctgc tcaaccccgc 540 

gccgcccgcc cgcggcggcc aggaacccga ccgccctccg ctctcaggat ttttacctcc 600 

tataactcca ccagaaaagt tttttctttc ccagctgatg ctggcacccc cacgggaact 660 

cttcaaaaag acgcctcgcc agattgcact gatggacgtt ggaaacatgg gccagtctgt 720 

ggacattagt gggcttcagt tagccttggc cgaacgccaa tctgaattgc caacgcaaag 780 

caaagcgagc ttccccagta ttctcagtga cccagacccg gattcttcta attctggatt 840 

tgacagctca gttgcctctc agatcacaga agctttagtc agcggaccaa agccacctat 900 

tgaaagccat tttcgaccag agtttattcg tccaccgcct ccactccaca tttgtgagga 960 

tgaacttgct tggctaaacc ccacggagcc tgaccacgcg atccagtggg ataaatcgat 1020 

gtgtgttaag aatagcactg gtgtggagat caaacgaata atggccaaag ccttcaaaag 1080 

ccccttatcc tctccccaac aaacacagct acttggtgag ttggaaaaag accccaaact 1140 

tgtctaccat attggcctca ccccagccaa acttcctgac cttgtggaaa acaacccttt 1200 

agtcgctata gaaatgttgc tgaaattaat gcagtcaagc cagatcactg agtatttctc 1260 

tgtcctggtc aatatggaca tgtctttaca ttcaatggaa gttgtaaatc gactaactac 1320 

agctgttgat ctacctcctg aatttattca cctttatata tcaaattgca tctctacttg 1380 

tgaacagatt aaggataaat atatgcagaa tcggttggtg cgtcttgtgt gtgtgtttct 1440 

ccaatccttg atccgtaaca aaattattaa tgtacaggat ttgtttatag aagtgcaggc 1500 

attctgtatt gaattcagta ggatacgaga agctgctggt cttttccggt tgttgaagac 1560 

attggatact ggggaaacac cttctgagac caaaatgtca aaataatacc tcatcagaac 1620 

catcccatcc attcactgtt cagctgtact gtgatttagt ttttacaccg ttaaaaccct 1680 

gagtggattg cttggtttaa tgcatataaa cagtacttta tctacttaaa gcaaaaaaaa 1740 

aaaaaaaaaa 1750 

<210> 4 
<211> 510 
<212> PRT 
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<213> Homo Sapiens 



<400> 4 



Met 


Pro 


Gly 


Gly 


Gly Ala 


Ser 


Ala 


Ala Ser Gly Arg 


Leu 


Leu 


Thr 


Ala 


1 








5 










10 








15 




Ala 


Glu 


Gin 


Arg 


Gly Ser Arg 


Glu 


Ala Ala Gly Ser 


Ala 


Ser Arg 


Ser 








20 










25 








30 






Gly 


Phe 


Gly 


Gly 


Ser Gly Gly Gly 


Arg Gly Gly Ala 


Ser Gly Pro Gly 






35 










40 








45 








Ser 


Gly 


Ser 


Gly 


Gly Pro 


Gly Gly 


Pro Ala Gly Arg 


Met 


Ser 


Leu 


Thr 




50 










55 








60 










Pro 


Lys 


Glu 


Leu 


Ser 


Ser 


Leu 


Leu 


Ser 


He 


He Ser 


Glu 


Glu 


Ala 


Gly 


65 








70 










75 








80 


Glv 


Gly 


Ser 


Thr 


Phe 


Glu Gly 


Leu 


Ser 


Thr 


Ala Phe 


His 


His 


Tyr 


Phe 






85 










90 








95" 




Ser 


Lys 


Ala 


Asp 
100 


His 


Phe 


Arg 


Leu 


Gly 
105 


Ser 


Val Leu 


Val 


Met 
110 


Leu 


Leu 


Gin 


Gin 


Pro 


Aso 


Leu 


Leu 


Pro 


qc, r 
Oc £ 


Ala Ala Gin Arg 


Leu 


Thr 


Ala 


Leu 






115 










120 








125 










Leu 


Leu 


TrD 




Met 


Tyr 


Arg 


Thr 


Glu 


Pro Leu 


Ala 


Ala 


Asn 


Pro 


130 










135 








140 










Phe 


Ala 


Ala 


Ser 


Phe 


Ala 


His 


Leu 


Leu 


Asn 


Pro Ala 




Pro Ala 


Arg 


145 










150 










155 








160 


Gly 


Glv 


Gin 


Glu 


trro 


Asp Arg 


Pro 


Pro 


Leu 


Ser Gly 


Phe 


Leu 


Pro 


Pro 










165 










170 








17 5 




lie 


Thr 


Pro 


Pro 
180 


Glu 


Lys 




Phe 


Leu 
185 


Ser 


Gin Leu 


Met 


Leu 
190 


Ala 


Pro 


Pro 


Arg 


Glu 


Leu 


Php 


Lys 


T \j<: 
L»y 5 


Thr 


Pro 


Arg 


Gin He 


Ala 


Leu Met Asp 






195 










200 








205 








Val 


Gly 
210 


Asn 


Met 


y 


bin 


oer 
215 


Val 


Asp 


He 


Ser Gly 
220 


Leu 


Gin 


Leu 


Ala 


Leu 


Ala 


Glu 


Arg 


Gin 


Ser 


r* i .« 
blU 


Leu 


Pro 


Thr 


Gin Ser 


Lys 


Ala 


Ser 


Phe 


225 










230 










235 








240 


Pro 


Ser 


He 


Leu 


Otr L 


Asp 


Pro 


Asp 


Pro Asp 


Ser Ser 


Asn 


Ser Gly Phe 




















250 








255 




Asp 


Ser 


Ser 


Val 


Ala 


Ser 


Gin 


He 


Thr 


Glu 


Ala Leu 


Val 


Ser Gly 


Pro 






260 










265 








270 






Lys 


Pro 


Pro 


He 


Glu 


Ser 


His 


Phe 


Arg 


Pro 


Glu Phe 


He 


Arg 


Pro 


Pro 




275 










280 








285 








Pro 


Pro 
290 


Leu 


His 


He 


Cys 


Glu 
295 


Asp 


Glu 


Leu 


Ala Trp 
300 


Leu 


Asn 


Pro 


Thr 


Glu 


Pro 


Asp 


His 


Ala 


He 


Gin 


TrD 


Asp 


Lys 


Ser Met 


Cys 


Val 


Lys Asn 


305 










310 










315 








320 


Ser 


Thr 


G:.y 


Val 


Glu 
325 


He 


Lys 


Arg 


He 


Met 
330 


Ala Lys 


Ala 


Phe 


Lys 
335 


Ser 


Pro 


Leu 


Ser 


Ser 
340 


Pro 


Gin 


Gin 


Thr 


Gin 
345 


Leu 


Leu Gly 


Glu 


Leu 
350 


Glu 


Lys 


Asp 


Pro 


Lvs 


Leu 


Val 


Tyr 


His 


He 


Gly Leu Thr Pro 


Ala 


Lys 


Leu 


Pro 






355 










360 








365 








Asp 


Leu 


Val 


Glu 


Asn 


Asn 


Pro 


Leu 


Val 


Ala 


He Glu 


Met 


Leu 


Leu 


Lys 


370 










375 








380 










Leu 


Met 


Gin 


Ser 


Ser 


Gin 


He 


Thr 


Glu Tyr 


Phe Ser 


Val 


Leu 


Val 


Asn 


385 










390 










395 








400 


Met 


Asp 


Met 


Ser 


Leu 


His 


Ser 


Met 


Glu 


Val 


Val Asn 


Arg 


Leu 


Thr 


Thr 








405 










410 








415 




Ala 


Val 


Asp 


Leu 


Pro 


Pro 


Glu 


Phe 


He 


His 


Leu Tyr 


He 


Ser 


Asn Cys 






420 










425 








430 






lie 


Ser 


Thr 


Cys 


Glu 


Gin 


He 


Lys 


Asp 


Lys 


Tyr Met 


Gin Asn Arg Leu 






435 










440 








445 








Val 


Arg 
450 


Leu 


Val 


Cys 


Val 


Phe 
455 


Leu 


Gin 


Ser 


Leu He 
460 


Arg 


Asn 


Lys 


He 


He 


Asn 


Val 


Gin 


Asp 


Leu 


Phe 


He 


Glu 


Val 


Gin Ala 


Phe 


Cys 


He 


Glu 


465 










470 










475 








480 


Phe 


Ser 


Arg 


He 


Arg 


Glu 


Ala 


Ala 


Gly Leu 


Phe Arg 


Leu 


Leu 


Lys 


Thr 










485 










490 








495 




Leu 


Asp 


Thr 


Gly 
500 


Glu 


Thr 


Pro 


Ser 


Glu 
505 


Thr 


Lys Met 


Ser 


Lys 
510 
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<210> 5 

<211> 3346 

<212> DNA 

<213> Homo Sapiens 

<400> 5 

ggcacgagct ggctcgcgcg tgccttttcc cctcaggttg tggggagagc ggaatcctgc 60 

tccgccgtcg cagcagcagc ggcagccccg gcagcctcgg gcgacagcgg cggcgcgcga 120 

gcccccgggc ggaccgtacc accgctcgcc agcacgcagg gggagccgcc cgtctcgccg 180 

cgcacgcctc ggcgaccccg cggggctgag gcgtcgccgc gcccggcagc gtgagcgcag 240 

agccggcctc gaccccgagc tcggagccce gcgggccgcg cccgccgccg gccccaccca 300 

tccgggtcga ggaggccgag gccatggctg agacggagga gcggagcctg gacaacttct 360 

ttgccaagag ggacaagaag aagaagaagg agcggagcaa ccgggcggcg agtgccgcgg 420 

gcgcagcggg cagcgccggc ggaagcagtg gagccgcggg tgcggcgggc ggcggggcgg 480 

gcgcggggac ccggccgggt gacggcggga ccgccagcgc gggggctgcg ggcccagggg 540 

ccgccaccaa ggctgtgacg aaggacgaag atgaatggaa agaattggag caaaaagagg 600 

ttgattacag cggcctcagg gttcaggcaa tgcaaataag cagtgaaaag gaagaagacg 660 

ataatgaaaa gagacaagat ccaggtgata actgggaaga aggtggaggt ggtggtggag 720 

gtatggaaaa atcttcaggt ccctggaata aaacagctcc agtacaagca cctcctgctc 780 

cagtaattgt tacagaaacc ccagaaccag cgatgactag tggtgtgtat aggcctcctg 840 

gggccaggtt aaccacaaca aggaaaacac cacaaggacc accagaaatc tacagtgata 900 

cacagttccc atccctgcag tcaactgcca agcatgtaga aagccggaag gataaagaaa 960 

tggagaagag ctttgaagta gtaagacaca aaaatagagg tagggatgag gtttcaaaaa 1020 

accaggccct taaacttcag ctagacaacc aatatgctgt gcttgaaaat cagaaaagca 1080 

gccactcaca atacaattaa ggaatgggct ttgctaaccc ttctgaggta actagactgc 1140 

agctaaccac caccaacagc cattcatcat ctgatctctg ctggatctac agacaccgat 1200 

gcagaccact cgatttcatg accggcccta ttgcactatg gaagttaaag tgtcacgact 1260 

gctctatgca tattggattt aggggaattt tcattgttac ataaatgtgt gaactagttt 1320 

caacagtgtt ctttcatatt tactctgcaa atacaaaaaa ccaaaacctg cagccagtgg 1380 

tcatttcaaa atctttttat gttcagatac tgagccttca taagggttga ctacctcaga 1440 

tttgctgcac tcattgtgga cttcatgtgg atcacaactt ctggataaga agattacaac 1500 

tattaagtgt cgatgtgaac cttgcaacca gctctactgg attcttatca gaaatcctgc 1560 

ataaaaagtc jgccatctgg gttctgatct gctgtaaaag atgaagattt aagtgacctt 1620 

aattaacctg tcctgtgccc tacccttaag gaatactctc tgtagtaggc tgttgttata 1680 

ttagacttcc tggaacacac cgctgaaaag aactgatgtg ttcagatcat ctgtgtaggg 1740 

ctgtgatttg taatttaaac taattgtatt ctgaggtaac cacaaaataa attcaaccaa 1800 

actggggtcc accaagtggg ggaaggggaa gggagagaat aatcttgggg gttttttttt 1860 

ttggtaattt ttttatttgg atagtgcttt tttgttttgt ttttgttctg cattaaggcc 1920 

ttttttgctt tgacttgaaa taagttcttt gacagagcat attgcttggt taattaagta 1980 

acctaaagta tgcattagga ttgtgaaatg tctcgtgagt atgccaatcc tgagggtgga 2040 

accaaatagc ctttgatgaa aagggcagtg gattctggag gctctacttc aggtgctgct 2100 

ataatgcctc atctaatcag gactaaattg tgtaggaaac tgcagtggga agaatatgct 2160 

ttctgctcag gctaagaggg tcactgatct gtccttagaa attcagagta acatgagcaa 2220 

aacctcagct aaaacccatt taagtggcat ggattgtgca tgatctttga taagaattcc 2280 

tcatgtactt gtgcctagtt tttcaaggta ttggctgttc tatagatgca gtgattgtcc 2340 

cagctagctc tgttaccagc cttttggtgt gtctttatgt tcatttggag agtcagggcg 2400 

aaagacaggt gatgtagcac ttctgttttt aataattatt gcttaaaata cctattaata 2460 

gttttgggtc atttaaaggg acttgaggaa gctacccagg attacagaag agtgtccacc 2520 

taacaagatg gtctggcagt ttcctagttt tgtatctggt tcaatagaaa tatgtgaaag 2580 

tggtaatgtc atcatttgat gcagagtccg ggtttctcta taataaatcc ctttgccaaa 2640 

tgcatgagtt gcagacttgc tactggcaag agtgaagcaa gtgggtgagt aaaactattt 2700 

tgacgtggga gcgttttcag ataggagttt agtcttgacg aaagtgtccg tgcaggaatt 2760 

ggactccgag gagggttaca gtatctcctg acgggacctg ccactcgcat ctgggcaatg 2820 

ttgacatttg aggtggcagg caggatgcct gccttctaat atatttgggt gagtaactga 2880 

gccagccaag ggaaggttga atgattaaat cagaaatggg attcttggta aactgaagac 2940 

ttttatttgg gaatgaaaaa ccttaaaaaa atctcttcat cgttgaactg tgcattttcc 3000 

ctgcattttt tcccaacaaa attttgttgg gggttatgtt actgaagaat gaacagatga 3060 

gtaagtggag gtgttatgta aaggcatatt gtactcgaaa tctgaagacc tgcagcagat 3120 

ttaaattaca actcttgtta taacttttta aaagattgtg aaaatatcaa aatataaatg 3180 

aatcaagttt taatatactg tatgatgggt ggatgaggct gtccattgta ccatttgttt 3240 

gaattctcag gcatggtttg gcagtgcaag aattctgtaa cattaacaaa ttcaataaaa 3300 

agtaaatata tggaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3346 

<210> 6 

<211> 258 

<212> PRT 

<213> Homo Sapiens 



SUBSTITUTE SHEET (RULE 26) 



WO 01/25250 



7 



PCT/US00/27649 



<400> 6 



Met Ala 


Glu 


Thr 


Glu 


Glu Arg 


Ser 


Leu 


Asp 


Asn 


Phe 


Phe 


Ala 


Lys 


Arg 


1 






5 










10 










15 




Asp Lys 


Lys 


Lys 


Lvs 


Lys Glu 


Arg 


Ser 


Asn 


Arg 


Ala 


Ala 


Ser 


Ala 


Ala 






20 










25 










30 






Gly Ala 


Ala 


Gly 


Ser 


Ala Gly 


Gly Ser 


Ser 


Gly Ala 


Ala 


Gly 


Ala 


Ala 




35 










40 










45 








Glv Glv 


Gly 


Ala 


Gly 


Ala Gly 


Thr Arg 


Pro 


Gly Asp 


Glv 


Glv 


Thr 


Ala 


50 










55 










60 










Ser Ala 


Gly 


Ala 


Ala 


Gly Pro 


Gly Ala 


Ala 


Thr 


Lys 


Ala 


Val 


Thr 


Lvs 


65 








70 










75 










80 


Asp Glu 


Asp 


Glu 


Trp 


Lys Glu 


Leu 


Glu 


Gin 


Lys 


Glu 


Val 


Asd 


Tvr 


Ser 








85 










90 










95 




Gly Leu 


Aro 


Val 


Gin 


Ala 


Met 


Gin 


lie 


Ser 


Ser 


Glu 


LVS 


Glu 


Glu 


Asp 






100 










105 










110 




Asp Asn 


Glu 


Lys 


Arg 


Gin Asp 


Pro Gly 


Asp 


Asn Trp 


Glu 


Glu 


Glv 


Glv 




115 










120 










125 








Glv Glv 


Glv 


Gly Gly 


Met 


Glu 


Lys 


Ser 


Ser 


Gly 


Pro 


TrD 


Asn 


Lys 


Thr 


130 










135 










140 










Ala Pro 


Val 


Gin 


Ala 


Pro 


Pro 


Ala 


Pro 


Val 


lie 


Val 


Thr 


Glu 


Thr 


Pro 


145 








150 










155 










160 


Glu Pro 


Ala 


Met 


Thr 


Ser 


Gly 


Val 


Tyr 


Arg 


Pro 


Pro 


Gly 


Ala 


Arg 


Leu 








165 










170 










175 




Thr Thr 


Thr 


Arg 


Lys 


Thr 


Pro 


Gin 


Gly 


Pro 


Pro 


Glu 


lie 


Tyr 


Ser 


Asp 






180 










185 










190 






Thr Gin 


Phe 


Pro 


Ser 


Leu 


Gin 


Ser 


Thr 


Ala 


Lys 


His 


Val 


Glu 


Ser 


Arg 




195 










200 










205 








Lys Asp 


Lys 


Glu 


Met 


Glu 


Lys 


Ser 


Phe 


Glu 


Val 


Val 


Arg 


His 


Lys 


Asn 


210 










215 










220 










Arg Gly 


Arg 


Asp 


Glu 


Val 


Ser 


Lys 


Asn 


Gin 


Ala 


Leu 


Lys 


Leu 


Gin 


Leu 


225 








230 










235 










240 


Asp Asn 


Gin 


Tyr Ala 


Val 


Leu 


Glu 


Asn 


Gin 


Lys 


Ser 


Ser 


His 


Ser 


Gin 



245 250 255 



Tyr Asn 



<210> 7 

<211> 2068 

<212> DNA 

<213> Homo Sapiens 

<400> 7. 

ggcacgaggt ggcatagcat aaccacagta agaacagaac agatattcag cagaaaactt 60 

tttatactct aattcttttt tttttttttt ttgagacaga gttttagtct tgtttcccag 120 

gctggagtgc aatggcacaa tcttggctca ctgcaacctc cgcctcctgg gttcaggcaa 180 

ttttcctgcc tcagcctccc aagtagctgg gattacaggc acccaccacc atgcccagct 240 

aatttttgta tttttaatag agagctaata attgtatatt taataaagac gggtttcacc 300 

atgttggcca ggctggtctt gaactcctga cctcaggtga tcctcctgca ttggcctccc 360 

aaagtgctgg aattccaggc atgagccact gcgcccagtc tacacactaa ttcttgttag 420 

cccaacagct gttctgttct atctacccct catttcacgc tcaaggagtc atacctagaa 480 

tagttacaca caagagggaa actggaagcc aaacactgta cagtattgtg tagaaagtca 540 

cctccctact ccttttattt tacatgagtg ctgatgtgtt ttggcagatg agctttcagc 600 

tgaggcctga tggaaattga gataacctgc aaagacataa cagtatttat gagttatatc 660 

ttagttcttg aaattgtgga atgcatgatt gacaatatat ttttaatttt tattttttca 720 

agtaatacca gtactgttta actatagcca gaactggcta aaatttttat attttcagag 780 

ttgaagttgg tgaagacatt catgatttaa acaccagatc ctgaaagggg ttaaatctac 840 

tttgaaatga atctgcaatc agtatttcaa agcttttctg gtaattttag tgatcttatt 900 

tgattagact ttttcagaag tactaaataa ggaattttaa caggttttta ttaatgcaca 960 

gataaataga agtacagtga ggtctatagc cattttatta aaatagctta aaagtttgta 1020 

aaaaaatgaa tctttgtaat tacttaatat gttagttaag aacccgtcaa gcttatattt 1080 

gctagactta caaattattt taaatgcatt tatctttttt gacactattc agtggaatgt 1140 

gtaagctagc taattcttgt tttctgattt aaagcacttt taaatcttat cctgccccct 1200 

aaaaacaaaa ggttttgatc acaaggggaa atttaagatt gttaaccctg tttttcagaa 1260 

gggctactgt taattgcaca taaacatgaa atgtgttttc ccctgtgtac taacacattc 1320 

taggcaaaat tcaaacttat agtggtaaag aaacaggttg ttcacttgct gaggtgcaaa 1380 

aattcttaag acttctgttt gaaattgctc aatgactagg aaaagatgta gtagtttact 1440 

aaaattgttt ttctaccata tcaaattaaa caattcatgc ctttttaggg tcaggcctac 1500 

aatgaatagg tatggtggtt tcacagaatt ttaaaataga gttaaaggga agtgatgtac 1560 
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atttcggggg cattagggta gggagatgaa tcaaaaaata cccctagtaa tgctttatat 1620 

tttaatactg caaaagcttt acaaatggaa accatgcaat tacctgcctt agttcttttg 1680 

tcataaaaac aatcacttgg ttggttgtat tgtagctatt acttatacag caacatttct 1740 

tcaattagca gtctagacat tttataaaca gaaatcttgg accaattgat aatatttctg 1800 

actgtattaa tattttagtg ctataaaata ctatgtgaat ctcttaaaaa tctgacattt 1860 

tacagtctgv: attagacata ctgtttttat aatgttttac ttctgcctta agatttaggt 1920 

tttttaaatg tatttttgcc ctgaattaag tgttaatttg atggaaactc tgcttttaaa 1980 

atcatcattt actgggttct aataaattaa aaattaaact tgtaaaaaaa aaaaaaaaaa 2040 

aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2068 

<210> 8 

<2U> 1027 

<212> DNA 

<213> Homo Sapiens 

<400> 8 

ggcacgaggt cccaccttgt ggaggatgga ggtgaccggg gacgccgggg taccagaatc 60 

tggcgagatc cggactctaa agccgtgtct gctgcgccgc aactacagcc gcgaacagca 120 

cggcgtggcc gcctcctgcc tcgaagacct gaggagcaag gcctgtgaca ttctggccat 180 

tgataagtcc ctgacaccag tcaccctggt cctggcagag gatggcacca tagtggatga 240 

tgacgattac tttctgtgtc taccttccaa tactaagttt gtggcattgg ctagtaatga 300 

gaaatgggca tacaacaatt cagatggagg tacagcttgg atttcccaag agtcctttga 360 

tgtagatgaa acagacagcg gggcagggtt gaagtggaag aatgtggcca ggcagctgaa 420 

agaagatctg tccagcatca tcctcctatc agaggaggac ctccagatgc ttgttgacgc 480 

tccctgctca gacctggctc aggaactacg tcagagttgt gccaccgtcc agcggctgca 540 

gcacacactc caacaggtgc ttgaccaaag agaggaagtg cgtcagtcca agcagctcct 600 

gcagctgtac ctccaggctt tggagaaaga gggcagcctc ttgtcaaagc aggaagagtc 660 

caaagctgcc tttggtgagg aggtggatgc agtagacacg ggtatcagca gagagacctc 720 

ctcggacgtt gcgctggcga gccacatcct tactgcactg agggagaagc aggctccaga 780 

gctgagctta tctagtcagg atttggaggt gggcggaaac cagggtcact gagctacaga 840 

ggaggacatg ccctgggatg tagtagtatc atgcagaggt gtgtgggccc ttttgttcac 900 

ctctgcagac tgtgaatcct agctgccagt ttgcctatta tatgccaagg catttgcaaa 960 

aatctcatta atctaaatca aaatagcttt aaagaaaaat gcaaaaaaaa aaaaaaaaaa 1020 

aaaaaaa 1027 

<210> 9 
<211> 268 
-<212> PRT 

<213> Homo Sapiens 



<400> 9 



Met 


Glu 


Val 


Thr 


Gly 


Asp 


Ala 


Gly 


Val 


Pro 


Glu 


Ser 


Gly 


Glu 


He 


Arg 


1 








5 










10 










15 




Thr 


Leu 


Lys 


Pro 


Cys 


Leu 


Leu 


Arg 


Arg 


Asn 


Tyr 


Ser 


Arg 


Glu 


Gin 


His 








20 










25 










30 






Gly 


Val. 


Ala 


Ala 


Ser 


Cys 


Leu 


Glu 


Asp 


Leu 


Arg 


Ser 


Lys 


Ala 


Cys 


Asp 






35 










40 










45 








He 


Leu 


Ala 


He 


Asp 


Lys 


Ser 


Leu 


Thr 


Pro 


Val 


Thr 


Leu 


Val 


Leu 


Ala 




50 










55 










60 










Glu 


Asp 


Gly 


Thr 


He 


Val 


Asp 


Asp 


Asp 


Asp 


Tyr 


Phe 


Leu 


Cys 


Leu 


Pro 


65 










70 










75 










80 


Ser 


Asn 


Thr 


Lys 


Phe 


Val 


Ala 


Leu 


Ala 


Ser 


Asn 


Glu 


Lys 


Trp 


Ala 


Tyr 










85 










90 










95 




Asn 


Asn 


Ser 


Asp 


Gly 


Gly 


Thr 


Ala 


Trp 


He 


Ser 


Gin 


Glu 


Ser 


Phe 


Asp 








100 










105 










110 






val 


Asp 


Glu 


Thr 


Asp 


Ser 


Gly 


Ala 


Gly 


Leu 


Lys 


Trp 


Lys 


Asn 


Val 


Ala 






115 










120 










125 








Arg 


Gin 


Leu 


Lys 


Glu 


Asp 


Leu 


Ser 


Ser 


He 


He 


Leu 


Leu 


Ser 


Glu 


Glu 


130 










135 










140 










Asp 


Leu 


Gin 


Met 


Leu 


Val 


Asp 


Ala 


Pro 


Cys 


Ser 


Asp 


Leu 


Ala 


Gin 


Glu 


145 










150 










155 










160 


Leu 


Arg 


Gin 


Ser 


Cys 


Ala 


Thr 


Val 


Gin 


Arg 


Leu 


Gin 


His 


Thr 


Leu 


Gin 










165 










170 










175 




Gin 


Val 


Leu 


Asp 


Gin 


Arg 


Glu 


Glu 


Val 


Arg 


Gin 


Ser 


Lys 


Gin 


Leu 


Leu 








180 










185 










190 






Gin 


Leu 


Tyr 


Leu 


Gin 


Ala 


Leu 


Glu 


Lys 


Glu 


Gly 


Ser 


Leu 


Leu 


Ser 


Lys 






195 










200 










205 








Gin 


Glu 


Glu 


Ser 


Lys 


Ala 


Ala 


Phe 


Gly 


Glu 


Glu 


Val 


Asp 


Ala 


Val 


Asp 
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210 215 220 

Thr Gly. lie Ser Arg Glu Thr Ser Ser Asp Val Ala Leu Ala Ser His 
225 230 235 240 

lie Leu Thr Ala Leu Arg Glu Lys Gin Ala Pro Glu Leu Ser Leu Ser 

245 250 255 

Ser Gin Asp Leu Glu Val Gly Gly Asn Gin Gly His 
260 265 

<210> 10 

<211> 2214 

<212> DNA 

<213> Homo Sapiens 



<400> 10 

ggcacgaggc 

ttactgtaga 

gggaagaaga 

agaagaccgt 

tgacaagagt 

ttgctcaaaa 

gaga ttt tar; 

acagcatcga 

ggcctgaact 

atacacatat 

atacatgtgt 

atacacatat 

aaaactgtag 

ttttattgcc 

tcaaataaaa 

tttttttgac 

cactgcaacc 

gggattacag 

tttcactatg 

ctcccaaagt 

ctctttttaa 

taattttact 

cagtaatatg 

gatctgtggt 

ctgttggtag 

ttaaaaccat 

ttagaagaaa 

tttataataa 

tggtgcattg 

tttggaagca 

gtcatcatct 

taaattatgt 

tatagtcctg 

tgatgtattt 

ggtgttttaa 

tcgccatgca 

gtgtagtaaa 



taaatgtaga 
aaattcaccc 
ggagaagaaa 
accacaaaag 
atgtggcctt 
attctcctgt 
agatgacata 
agatcttgga 
gagagttgat 
atatgtatat 
gtatgtatgc 
atgtatacat 
actgtcctcg 
.aaagtcaaat 
aattttggtc 
agagtctcgc 
tccgcctccc 
gcgcccgcca 
ttggccaggc 
gctggtatta 
ctttattttt 
gatctaaagc 
tctccttgct 
tgaatttgct 
gagttgtttg 
gtatgtgcgc 
taatcagcct 
aaaattgcaa 
tgtggtagaa 
tgtttggtaa 
gctctgagtg 
gagtaaaatt 
jaaa tagcaa 
gatgaattta 
gttgctgtgg 
aattacaatc 
gatgtaaaat 



caatggttag 
aaacaagaag 
aaacagaaga 
gttactatag 
gcaacttttg 
ggtgcctcag 
attgatgtca 
gaagtaaaga 
atggccaaag 
atacacatat 
atgtatatac 
atatacacat 
tccttggcat 
aaacgggaga 
atttggtact 
actgttgcct 
gggttcaagt 
ccacgcccag 
tggtctcgaa 
cagatgtgag 
tgaaaaaccc 
tgagtgattt 
tctttgatgt 
gtgttgttat 
agctattctg 
tgtttgaacc 
tgcataaaac 
atgttataaa 
tattatgcat 
tgtcaagtgg 
gaaggccgtt 
gtgctaacca 
ttgaaacatg 
aactttaagt 
acacttttaa 
ttgaatgagt 
taaaaaaaaa 



agaagaattt 
ctggaattag 
gaggtggaag 
ccaaaattcc 
aaattgatct 
taacagggga 
ttcaggaaaa 
agtgaatttg 
ggagagaggc 
atgtatgtat 
atatatacat 
atatgtatac 
tttcactgtt 
ctgtcatgct 
gactttctct 
gggctggaat 
gattctcctg 
ctaatttttg 
ctcctgacct 
ccaccgcacc 
ggtagacttt 
tttaaaagaa 
gatagttttg 
gaagtccacc 
gagattattt 
agtaagccac 
ttatggatga 
tgaacaattg 
atgtttaaag 
agtaccccag 
cagagaggct 
gttaaaagta 
tcttctcaca 
caggtgctgc 
gaaacttaga 
gtttttttaa 
aaaaaaaaaa 



tccaaatgaa 


tttgcaaaac 


60 


tgagggtcaa 


ggaacagcag 


120 


gggtcaaata 


aaacaaaaaa 


180 


cagagcaaag 


aagaaatatg 


240 


taaagaagca 


caaagatttt 


300 


ggatgaaatt 


atcattcagg 


360 


atggccagag 


gtagatgatg 


420 


aaaatttgtc 


tgtatttaat 


480 


cttttaaaat 


atatatatat 


540 


acacatatac 


acatgtatat 


600 


acacatatat 


gtatacatat 


660 


atatatatat 


attctacagt 


720 


ctgtacaagg 


ctgcttgttt 


780 


catgcatgaa 


tagaatttag 


840 


ctctctctct 


ctctcttttt 


900 


gcagtggtgc 


gatctcggct 


960 


cctcagcctc 


ccaagtaggt 


1020 


tatttttagt 


agagatgggg 


1080 


cgtgattggc 


ccacctcagc 


1140 


cagcctgagt 


ttctctttct 


1200 


gtggggagca 


tttttgttga 


1260 


tttgaatttg 


gcttcctcac 


1320 


agatgggtga 


gaatctaata 


1380 


ctgtgggcac 


aataacataa 


1440 


ggtaaagtat 


actaaaagcc 


1500 


ttctttgaca 


ttagaagaca 


1560 


aagtattcat 


cacaatatta 


1620 


ggaaatggtt 


aaagaagtga 


1680 


aatcatattt 


tctaagatta 


1740 


atacatttta 


gacatttatc 


1800 


agaggttctt 


attctggcta 


1860 


ctgtacaccc 


atgctcaata 


1920 


agagaaaatg 


acagttttaa 


1980 


aaattggaaa 


gaagacttgt 


2040 


acccatggaa 


cccttgttta 


2100 


aaataaagta 


ttagaaaaat 


2160 


aaaaaaaaaa 


aaaa 


2214 



<210> 11 
<211> 150 
<212> PRT 

<213> Homo Sapiens 
<400> 11 

His Glu Ala Lys Cys Arg Gin Trp Leu Glu Lys Asn Phe Pro Asn Glu 

1 5 10 15 

Phe Ala Lys Leu Thr Val Glu Asn Ser Pro Lys Gin Glu Ala Gly lie 

20 25 30 

Ser Glu Gly Gin Gly Thr Ala Gly Glu Glu Glu Glu Lys Lys Lys Gin 

35 40 45 

Lys Arg Gly Gly Arg Gly Gin lie Lys Gin Lys Lys Lys Thr Val Pro 

50 55 60 

Gin Lys Val Thr lie Ala Lys lie Pro Arg Ala Lys Lys Lys Tyr Val 
65 70 75 80 



SUBSTITUTE SHEET (RULE 26) 



WO 01/25250 

10 

Thr Arg Val Cys Gly Leu Ala Thr Phe Glu 

85 90 
Gin Arg Phe Phe Ala Gin Lys Phe Ser Cys 

100 105 
Glu Asp Glu lie He lie Gin Gly Asp Phe 

115 120 
Val He Gin Glu Lys Trp Pro Glu Val Asp 

130 135 
Leu Gly Glu Val Lys Lys 
145 150 
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He Asp Leu Lys Glu Ala 
95 

Gly Ala Ser Val Thr Gly 
110 

Thr Asp Asp He He Asp 
125 

Asp Asp Ser He Glu Asp 
140 



<210> 12 
<211> 433 
<212> PRT 
. <213> Caenorhabditis Elegans 



<400> 12 



Pro 




Arg 


Ala 


Glu 


He 


Val 


He 


He Gly Gly Gly 


Leu Ser Gly Ser 


1 








5 








10 


15 


Ser 


Thr 


Ala 


Phe 


Tm 


Leu 


Lys 


Glu 


Arg Phe Arg Asp Glu Asp Phe Lys 








20 










25 


30 


Val 


Val 


Val 


Val 


Glu 


Asn 


Asn 


Asp 


Val Phe Thr Lys 


Ser Ser Thr Met 






35 










40 




45 


Leu 


Ser 


Thr 


Glv 


Glv 


He 


Thr 


Gin 


Gin Phe Ser He 


Pro Glu Phe Val 




50 










55 




60 




Asp 


Met 


Ser 


Leu 


Phe 


Thr 


Thr 


Glu 


Phe Leu Arg His 


Ala Gly Glu His 


65 










70 






75 


80 


Leu 


Arg 


He 


Leu 


ASD 


Ser 


Glu 


Gin 


Pro Asp He Asn 


Phe Phe Pro Thr 










85 








90 


95 


Glv 


Tvr 


Leu 


Arg 


Leu 


Ala 


Lys 


Thr Asp Glu Glu Val Glu Met Met Arg 








100 










105 


110 


Ser 


Ala 


TrD 


Lys 


Val 


Gin 


lie 


Glu Arg Gly Ala Lys 


Val Gin Leu Leu 






115 










120 




125 


Ser 


Lvs 


Asp 


Glu 


Leu 


Thr 


Lys 


Arg 


Tyr Pro Tyr Met Asn Val Asp Asp 




130 










135 




140 




Val 


Leu 


Leu 


Ala 


Ser 


Leu 


Gly 


Val 


Glu Asn Glu Gly Thr He Asp Thr 


145 










150 






155 


160 


Tro 


Gin 


Leu 


Leu 


Ser 


Ala 


He 


Arg 


Glu Lys Asn He 


Thr Leu Gly Val 










165 








170 


175 


Gin 


Tyr 


Val 


Lys 


Gly 


Glu 


val 


Glu 


Gly Phe Gin Phe 


Glu Arg His Arg 








180 










185 


190 


Ala 


Ser 


Ser 


Glu 


Val 


His 


Ala 


Phe 


Gly Asp Asp Ala 


Thr Ala Asp Glu 






195 










200 




205 


Asn 


Lys 


Leu 


Arg 


Ala 


Gin 


Arg 


He 


Ser Gly Val Leu 


Val Arg Pro Gin 




210 










215 




220 




Met 


Asn 


Asp 


Ala 


Ser 


Ala 


Arg 


Pro 


He Arg Ala His 


Leu He Val Asn 


225 










230 






235 


240 


Ala 


Ala 


Gly 


Pro 


Trp 


Ala 


Gly 


Gin 


Val Ala Lys Met 


Ala Gly He Gly 










245 








250 


255 


Lys 


Gly 


Thr 


Gly 


Leu 


Leu 


Ala 


Val 


Pro Val Pro He 


Gin Pro Arg Lys 








260 










265 


270 


Arg 


Asp 


Val 


Phe 


Val 


He 


Phe 


Ala 


Pro Asp Val Pro 


Ser Asp Leu Pro 






275 










280 




285 


Phe 


He 


He 


Asp 


Pro 


Ser 


Thr 


Gly 


Val Phe Cys Arg Gin Thr Asp Ser 




290 










295 




300 




Gly 


Gin 


Thr 


Phe 


Leu 


Val 


Gly 


Arg 


Thr Pro Ser Lys Glu Glu Asp Ala 


305 










310 






315 


320 


Lys 


Arg 


Asp 


His 


Ser 


Asn 


Leu 


Asp Val Asp Tyr Asp Asp Phe Tyr Gin 










325 








330 


335 


Lys 


He 


Trp 


Pro 


Val 


Leu 


Val 


Asp Arg Val Pro Gly Phe Gin Thr Ala 








340 










345 


350 


Lys 


Val 


Lys 


Ser 


Ala 


Trp 


Ser 


Gly Tyr Gin Asp He Asn Thr Phe Asp 






355 










360 




365 


Asp 


Ala 


Pro 


Val 


lie 


Gly 


Glu 


His 


Pro Leu Tyr Thr Asn Leu His Met 




370 










375 




380 




Met 


Cys 


Gly 


Phe 


Gly 


Glu 


Arg 


Gly Val Met His Ser 


Met Ala Ala Ala 


385 










390 






395 


400 


Arg 


Ala 


Tyr 


Ala 


Glu 


Arg 


He 


Phe Asp Gly Ala Tyr He Asn Val Asn 
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405 410 415 

Leu Arg Lys Phe Asp Met Arg Arg lie Val Lys Met Asp Pro He Thr 
420 425 430 

Glu 



<210> 13 

<211> 223 

<212> PRT 

<213> Homo Sapiens 



<400> 13 



Met 


Lys 


Val Ala Val 


lie Gly Gly 


1 




5 






Tyr 


Phe 


Leu Ala Lys 


Ala Gly Ala 




20 






Tyr 


Leu 


Leu Tyr Gly 


Ala 


Ser Gly 




35 




40 


Gin 


Phe 


Thr Asn Glu 


Ala 


Met He 




50 






55 


Leu 


Tyr 


Asp Glu Leu 


Gin 


Ser Glu 


65 






70 




Arg 


Asp 


Gly Tyr Val 


Lys 


He Ala 






85 






Arg 


Glu 


Glu Val Glu 


Phe 


Gin Arg 




100 






Val 


Glu 


Pro Glu Phe 


val 


Lys Glu 






115 




120 


Ala 


Phe 


Thr Ala Ala 


Ser 


Tyr Phe 




130 






135 


Trp 


Pro 


Val Val Trp 


Gly Leu Ala 


145 






150 




Glu 


He 


Tyr Asp Tyr 


Thr 


Pro Ala 






165 






Leu 


Thr 


Val Lys Ala 


Ser Gly Glu 






180 






Asn 


Ala 


Ala Gly Ala 


Trp Ser Asn 






195 




200 


Glu 


Leu 


Asn Asn Lys 


Val 


Phe Arg 




210 






215 



Gly Val Ala Gly Leu Ser Ala Ala 

10 15 
Asp Val Lys Val Phe Glu Gin Lys 
25 30 
Arg Asn Ser Gly Gly Leu Thr Ala 
45* 

Lys Leu Ala Lys Arg Thr Leu Glu 
60 

Val Gly Phe Asn Phe Leu Leu Arg 

75 80 
Gly Lys Gly Glu Glu Ala Lys Leu 

90 95 
Lys Ala Gly Val Lys Val Lys Met 
105 110 
Leu Phe Pro Asp He Asn Thr Ser 
.125 

Ala Asp Gly Gly Val Val Phe Pro 
140 

Lys Gly Cys Arg Glu Leu Gly Val 
155 160 
Ser Val Glu Val Lys Gly Asn Asp 

170 175 
Ser Tyr Lys Val Asp Tyr He He 
185 . 190 

Glu He Ser Gin Gin Ala Gly Val 
205 

Glu Glu He Cys Val Thr Glu 
220 



<210> 14 
<211> 175 
<212> PRT 

<213> Drosophila Melanogaster 



<400> 14 



Leu 


Asp 


Asp 


Phe 


Phe 


Ala 


Lys 


Lys Asp 


Lys Lys Lys Ser Lys Asn Lys 


1 






5 








10 15 


Thr 


Lys 


Phe 


Val 


Thr 


Ala 


Asp 


Glu Met 


Val Lys Asn Leu Glu Asp Gly 






20 








25 


30 


Thr 


Lys 


Arg 


Glu 


Val 


Val 


Lys 


Pro Lys 


Lys Pro Glu Ala Ala Ala Gly 




35 










40 


45 


Gly 


Val 


Ala 


Val 


Val 


Gly 


Glu 


Asn Glu 


Asn Ser Gly Thr Lys Val Pro 


50 










55 




60 


Glu 


Ser 


Ala 


Pro 


Pro 


Val 


Glu 


Glu Glu 


Trp Lys Glu Phe Glu Glu Glu 


65 










70 






75 80 


Gin 


Arg 


Lys 


Asp 


Tyr 


Ser 


Gly 


Leu Lys 


He Gly Gin Leu Ser Thr He 








85 








90 95 


Thr 


Ala 


Gin 


Glu 


Ser 


Ala 


Glu 


Ser Arg 


Ala Thr Arg Val Pro Thr Ala 








100 








105 


110 


Gin 


Asp 


Gly 


Gly 


Asn 


Tyr 


Asp 


Glu Asp 


Asp Glu Asp Ser Asn Gly Tyr 






115 










120 


125 


Asp 


Asn 


Ala 


Asp 


Val 


Asn 


Lys 


Glu Arg 


Val Gly His Gly Pro Trp Lys 


130 










135 




140 


Lys 


Val 


Val 


Pro 


Ala 


Glu 


Glu 


Val Met 


Gin He Pro Val Pro Val Glu 


145 










150 






155 160 


Val 


Glu 


Lys 


Pro 


Ser 


Ser 


Lys 


Thr Tyr 


Val Ser Pro Ala Leu Arg 
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